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CACCTTGACT TGGGTGAGTA GGCACGGTAG GAGCTTTTCT CATAATCTCC TCTACCGTTG 3 60 

ACAAGGAATC AGCCATGAGT TCTTCAGTTG AAGGTTCATT TGCAGGAGTG CGAACTACTG 420 

CCTCATCTTC TTTCAGAACT TCATCATAGC CTTTTACTTT TTCTAAATCT CTCAGAATCT 4 80 

GCTCTTTAAA GCGTAATTTC TCTTCTGCTC TTGACTTTTC ACTCAAAAGT TTTTCCTCCT 540 

TGTTGAGAAT CCATAATATT AGAGCTGAGA AGTCCAAAAA AAGCAATCTA TGATACTTTT 600 

CCTAACGGAT TTTGTCATTT CCCAGACCAT ATCATACCAT GTTTCCCCTG CAAAGGTTGA 660 

CTGGGAA 6 67 



(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

GGGAAGCCAA GGTATTTTAT CGGATGAAGT TGTTACTAGT TCTTCACCGA TGGCTACAAA ' 60 

AGAGTCTTCT AATGCAATTA CTAATGATTT AGATAATTCA CCAACTGTTA ATCAGAATCG 120 

TTCTGCTGAA ATGATTGCCT CTAATTCAAC CACTAATGGT TTAGATAATT CGTTAAGTGT 180 

TAATAGTATC AGCTCTAATG GTACTATTCG TTCCAATTCA CAATTAGACA ACAGAACAGT 240 

TGAATCTACA GTAACATCTA CTAATGAAAA TAAGAGTTAT AAGGAAGATG TTATAAGTGA 300 

CAGAATTATC AAAAAAGAAT TTGAAGATAC TGCTTTAAGT GTAAAAGATT ATGGTGCGGT 3 60 

AGGTGATGGG ATTCATGATG ATCGACAAGC AATTCAAGAT GCAATAGATG CTGCAGCTCA 420 

AGGGCTAGGT GGAGGAAATG TATATTTTCC TGAAGGAACT TATTTAGTAA AAGAAATTGT 4 80 

TTTTTTAAAA AGTCATACAC ACTTAGAATT GAATGAGAAA GCTACAATTC TAAATGGTAT 540 

AAATATTAAG AATCACCCTT CCATTGTTTT TATGACAGGT TTATTTACGG ATGATGGTCC 600 

GCAAGTAGAA TGGGGCCCAA CAGAAGATAT TAGTTATTCT GGTGGTACGA TTGATATGAA 660 

CGGTGCTTTG AATGAAGAAG GAACTAAAGC AAAAAATCTA CCACTTATAA ATTCTTCAGG 720 

TGCATTTGCT ATTGGGAATT CAAATAACGT AACTATAAAA AATGTAACAT TCAAGGATAG 780 

TTATCAAGGG CATGCTATTC AAATTGCAGG TTCGAAAAAT GTATTAGTTG ATAATTCTCG 840 

TTTTCTTGGG CAAGCCTTAC CCAAAACGAT GAAGGATGGG CAAATCATAA GTAAGGAGAG 900 

CATTCAGATT GAACCATTAA CTAGAAAAGG TTTTCCTTAT GCCTTGAATG ATGATGGGAA 9 60 

AAAATCTGAA AATGTGACTA TTCAAAATTC CTATTTTGGC AAAAGTGATA AATCTGGGGA 1020 
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ATTAGTAACA GCAATTGGCA CACACTATCA AACATTGTCG ACACAGAACC CCTCTAATAT 1080 

TAAAATTCAA AATAATCATT TTGATAACAT GATGTATGCA GGTGTACGTT TTACAGGATT 1140 

CACTGATGTA TTAATCAAAG GAAATCGCTT TGATAAGAAA GTTAAAGGAG AGAGTGTACA 1200 

TTATCGAGAA AGCGGAGCAG CTTTAGTAAA TGCTTATAGC TATAAAAACA CTAAAGACCT 12 60 

ATTAGATTTA AATAAACAGG TGGTTATCGC CGAAAATATA TTTAATATTG CCGATCCTAA 1320 

AACAAAAGCG ATACGAGTTG CAAAAGATAG TGCAGAaTwT TTAGGAAAAG TATCAGATAT 1380 

TACTGTAACA AAAAATGTAA TTAATAATAA TTCTAAGGAA ACAGAACAAC CAAATATTGA 1440 

ATTATTACGA GTTAGTGATA ATTTAGTAGT CTCAGAGAAT AGT 1483 



(2) INFORMATION FOR SEQ ID NO: 316: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 



CCTGAACGCT 


TTTTTATAAA 


TATCATAAAG 


CCAATCTGAT 


TTATCAAGTG 


TGTCTAAGCG 


60 


ACGCGAATTA 


AAATTCATTG 


CATAPTCCAT 


CG CTTCTAAT*. 


AAACT C ATTT 


TTGAAAAGAC 


120 


GTTAAAATCA 


TCTAAATTCT 


GACTCCAATA 


TAATAACAAA 


ACCAATCCCA 


TAATATCCTC 


180 


TGGTTGATTA 


TTCAATAAAT 


TTAAGTTGGT 


TTCATAAAAC 


CCTGGAGTTC 


CAAATAGAGG 


240 


CAACTTTTTT 


TCTTCAATTT 


GAGTTTCTTT 


CCTTAGGGCA 


TGCTCAAAGT 


CTATAATATA 


300 


AATATTATTT 


CTATTATCAA 


TAAGTATATT 


ATTAAATGAT 


AAATCTCTAT 


AGGAAAGATT 


360 


ATATTTGGAG 


TTTATTATCT 


CCATATAATC 


AATTAATGTT 


AAAAACCAAT 


CATACGAGCC 


420 


ACTAACCATA 


TTATACTCGC 


TTAATTTATC 


TGCAATAATA 


AACTCAAATT 


CCACAAAATA 


480 


CGAATTCTTT 


ATGTAAAAAT 


CGTTAAAAAC 


TTTTGGAGTA 


AATTCCTCCT 


TTTCCAATTC 


540 


TACTAATATT 


TCTCTTTCAT 


TTATTAAACG 


ATTCACAGAA 


TCTCTATTTG 


TAAAATCAAC 


600 


CAACGATAAA 


TCACTAGCTT 


CTTTTAATAA 


AGAATAAACT 


CGCTTTTGAG 


TATTAAATAC 


660 


TTTATAAACT 


CCACCTTTGG 


CATTTTTAGA 


AATCACTTCC 


AAAATAATAT 


ATTGATCAGG 


720 


AATAGTGTTA 


TATCTTGGAA 


TATAGTAATC 


CCTTATTGGA 


ACATTCACAT 


TTGAAGGGAT 


780 


TTTCTTATCT 


CTTTTATCCT 


TGAAAGTGCT 


ATCTTTTACG 


AACTCCCCAT 


ATCTGTAATA 


840 


TACAACCTCG 


CTAAGTTGAA 


ATCTGAAATC 


TGATGGTATG 


TTTACACCCT 


TTACACCTTT 


900 
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ATACAATATT 


TCTAATTTGT 


GTAACAAACG 


TTGAAACTCT 


TTATTATCTT 


TTGGATAAAT 


960 


TGTAATGAAT 


TTCCCGACTT 


GTGAATAACC 


ATTAAGCCCT 


GTATTTTGCA 


AAGAAAGTTC 


1020 


TTTAATGCTA 


ACCAAAATTT 


TGAAATTTAT 


CTTCTTCTCT 


CTAGAAAATA 


TAAAATCAAA 


1080 


GAATTTTTTA 


GCAACCAAAT 


TAGCATTTAA 


TATTGAAGCG 


CTCAGGTGTA 


TTTTAAATCC 


1140 


CTTAGATTGG 


GTGATATTAG 


ACGGCAAATT 


ATATAACCAA 


TGTTCATCAC 


TAAAATTATC 


1200 


ACTAATTTTA 


TATTCTAATA 


ATAAATTATG 


GTATGCGTCT 


TCTATTTCAG 


TTTCATAGTC 


1260 


CAAATAGTTT 


AAATACTTTT 


CGTAATTCAT 


ATTAAGAAAT 


CTTCTCCATA 


AATTTTTAGA 


1320 


CCATCATTTA 


AAGCCAAACA 


ATTTAAAGCG 


TGATAATAAA 


TGTTGATAAT 


CAATGTAACT 


1380 


TTCAGTCCTC 


TATTTTGTAA 


TTCCTTCACC 


AATAATTTTA 


TGCTATATCT 


ATTTTCTCGA 


1440 


GGCAATTTAT 


AGGACTTCAA 


GATAAAACCA 


TAAAAGAGAT 


AAGTATTATA 


ATCTGACAAT 


1500 


CCAGTTTCAG 


AATAATTTTT 


TAGAAAAATA 


TCTAGTGATT 


CTGATAATTC 


ATCCGGAATA 


1560 


ATTCTTTTAA 


CATCGTATTT 


ATTTTTCATA 


TCGGCCACTC 


TTCCTTAAAA 


AGCTCACAAT 


1620 


AAAATTTTAA 


ATTTCTATAC 


AACAATCCGA 


GAGTAGTCTC 


ACAATTTGAA 


CATTTCACAT 


1680 


CACTCTTAAT 


ATATAAAAAA 


TGAATTAATC 


AGAAACCTCT 


GACTAAGATT 


TCCTAATTAA 


1740 


TTCACTTTCT 


ATATCATAGT 


AAGGAATTCT 


ATTATCCCTA 


ATTGAAAATT 


GAAATTTTAT 


1800 


GTTTTATATA 


TTAACAATTA 


TGCGGATTGT 


AAATCTTGTC 


TAACAAAATG 


GCAAGTGCTA 


1860 


CTATGTGCCC 


CAGAAGGCGA 


TGCAACGCTA 


TTTTGAATTG 


AAAGAGCATA 


ATCATCCATA 


1920 


TCATTTAAGT 


CACGGATTAG 


CAATGCTTCC 


TTCTCTCTTC 


CGACAATTCC 


AAATTTTCTA 


1980 


ATTACCTTTT 


CAGGATTATC 


AAAAAATTCT 


CCAACAACTT 


CCATATTTCC 


TTGAAGTTCA 


2040 


TTCAAGAAAG 


CTTTCATTTG 


ACTACTCATT 


ATATAGCTCC 


TTTTCTATTA 


CTTTATTTGG 


2100 


AATCAAAACT 


TACTTGTACA 


TTGGAAACAC 


CTCTATTCTA 


CGCTTTCATA 


TTGCTGCATG 


2160 


ACACTTTCAA 


AATCAAATTG 


CTAAAAATAA 


TTTTTTAAAG 


CTTAATTTAG 


ATTTAATTAC 


2220 


ATATATCTCA 


AAAAATTGTT 


TTGAAATTAG TAAATTAAAA 


TAGGTTTCTG 


TACTTATAGG 


2280 


AACTAGTTAT 


AAAAACTTCG 


CCCATCATAA 


AATATCTATT 


TAAGTAAAAC 


AAAAATTTTA 


2340 


TAATTTTTTG 


ATTTTTAAGT 


GACTATAATC 


TCCTATCTAT 


AAATACCATT 


CGCAGGACCT 


2400 


GGATCAATCC 


CTCTAGCCAT 


CTTATGAACT 


TGAGTTCCTC 


CAGACAGTCC 


CGG 


2453 



(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 



CCAATTTGAA 


GGCTCTAAAA 


CAATGGAAAA 


GTGCTACACA 


GATGTGACAG 


T^TTTGCCAT 


60 


TCCAGCAgTA 


CTCAAAAACT 


TTACTTATCA 


CCAGTTTTAG 


ATGGCTTTAA 


CAGCGAAATT 


120 


ATTGCTTTTA 


ATCTTTCTTG 


TTCGCCTAAT 


TTAGAATAAG 


TACAAACAAT 


GTTGGAACAG 


180 


GCATTCAAAG 


AGAAGCACTA 


TGAGAATACG 


ATTCTCCATA 


GTGACCAAGG 


CTGGCAATAC 


240 


CAACACGATT 


CTTATCATCG 


GTTCCTAGAG 


AGTAAGGGAA 


TTCAAGCATC 


CATGTCACGC 


300 


AAGGGCAACA 


GCCCAGACAA 


CGGCATGATG 


GAATCTTTCT 


TTGGCATTTT 


GAAATCGGAG 


360 


ATGTTTTATG 


GTTATGAGAA 


GAACTTTAGA 


TCTTTAGAAA 


ACCTTGAACA 


AGCTATTGTG 


420 


GACTACATTG 


ATTATTACAA 


CAACAAGAGA 


ATTAAGGTAA 


AGCTAAAAGG 


ACTTAGCCCT 


480 


GTGCAATACA 


GAACTAAATC 


CTTCGGATAA 


ATTAATTGTC 


TAACTTTTGG 


GGTGCAGTAC 


540 


ATTTTTGGTA 


TATATAAAAT 


TTGTAGGAGC 


TATATCTACA 


ATTTTATATT 


CCCAGTTTAT 


600 


GGATGTAACT 


TACTATATTC 


ACAATGTTAT 


CCAGTGTTTT 


TTCTCTAATA 


TTTAAGGAGT 


660 


GTTCTGTTTC 


TCGAATAAAT 


TCTTCAAAGT 


TTAACCCGTC 


AACTTGTTCC 


TGAACAAGAA 


720 


AATAATCATC 


CACGATATAA 


AATTCATCAG 


TTAAATTAGT 


AGTATAACTT 


TTATCGGCTA 


780 


ATTTTTTTAG 


CATGTGAGCT 


TCATTTT'PTA 


TATCATCAT'.G 




TCTCCTTCAG 


a40 


CATCATT^TT 


CACAAAAGGT 


CTTGACTGCT 


TGATGATTAC 


TTTTTGCCCG 


TCCGATTTTC 


900 


TAATTGCCCG 


ATAAACATTT 


CCTTTATTTG 


ATCTCTTAAT 


AATTTTTTCC 


ATTTTGTATT 


960 


TATTTATTGC 


AGAGTCCTTA 


CTTGAAACTT 


CACATGTGGT 


TTGAAAATAA 


ATCCTTTTTT 


1020 


CTTCTTCTGA 


AAATAAATCC 


ATTTTCCGG 








1049 



(2) INFORMATION FOR SEQ ID NO: 318: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

TTAGTTGGTT AGAATCAGAA AATCGCCGAA GTGGTTATTT ATTTTTGAAT AAATTTAACG 60 

AACCAATTAC AGCAAGAGGA GTTGCTCAAC AGTTAAAAAA TTATGCTGAT AAATACAAAA 120 

TGAATCCTAA AGTAATTTAC CCTCATTCTT TTAGGCATTT ATTTGCTAAG AATTTTTTAG 180 
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CGAAGTATAA TGATATTGCC TTGCTTGCAG ATTTGATGGG ACACGAAAGT ATAGAAACTA 240 

CTCGAATTTA TCTAAGGAAA ACAGCTACTG AACAACAAAA TATTGTAGAT AAAATTGTTA 300 

ATTGGTAAAA AATAACAGGT GGTCAAACTG ACTACCTGCT ATTTTTGTGA TTATGGCTCT 360 

TATTATGGGA ATATACCTAT GAATTGGGTT GTTATAAAAA TAAAAGATAT TTTTTCAATA 420 

AATACAGGTC TTTCTTACAA GAAGGGCGAT TTAAGCATTA ATAATAAAGG TGTTAGAATT 480 

ATACGTGGTG GTAATATTAA GCCTTTAGAA TTTTCTCTGT TGGATAATGA TTACTACATT 540 

GATACACAAT TCATCTCCTC TGAGCAAGTT TATTTAAAAC ATAATCAGCT AATAACACCT 600 

GTATCAACCT CTTTAGAACA TATTGGAAAG TTTGCAAGAA TCGAGAAAGA CTATGATGGT 660 

GTTGTGGCTG GTGGATGTAT TTTCCAATTA ACACCATTCG AAAGTGCAGA GATGATGTCA 720 

AAATGTCTAT TATGTAACTT GTCCTCTCCG TTATTTTATA AACAATTGAA AGCAAT 776 



(2) INFORMATION FOR SEQ ID NO: 319: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

TGCAATGCGG CGGCTGCATA CGCTTGATCC GGCTACCTGC CCATTCGACC ACCAAGCGAA 60 

ACATCGCATC GAGCGAGCAC GTACTCGGAT GGAAGCCGGT CTTGTCGATC AGGATGATCT 120 

GGACGAAGAG CATCAGGGGC TCGCGCCACC GAACTGTTCG CCAGGCTCAA GGCGCGCATG ISO 

CCCGACGGCG AGGATCTCGT CGTGACCCAT GGCGATGCCT GCTTGCCGAA TATCATGGTG 240 

GAAAATGGCC GCTTTTCTGG ATTCATCGAC TGTGGCCGGC TGGGTGTGGC GGACCGCTAT 300 

CAGGACATAG CGTTGGCTAC CCGTGATATT GCTGAAGAGC TTGGCGGCGA ATGGGCTGAC 3 60 

CGCTTCCTCG TGCTTTACGG TATCGCCGCT CCCGATTCGC AGCGCATCGC CTTCTATCGC 420 

CTTCTTGACG AGTTCTTCTG AGCGGGACTC TGGGGTTCGA TGTCGACAGC CCGCCTAATG 480 

AGCGGGCTTT TTTTTCCTGA GGCTGGACGA CCTCGCGGAG TTCTACCGGC AGTGCAAATC 540 

CGTCGGCATC CAGGAAACCA GCAGCGGCTA TCCGCGCATC CATGCCCCCG AACTGCAGGA 600 

GTGGGGAGGC ACGATGGCCG CTTTGGTCCC GGATCAATTC GCGCGACCGG ATCGATCC 658 
(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1475 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

CCGGCTTAAT TTTTAGAAAA CGTGGGCAGG GAACCTTTGT TCTCTCTCGT GGCAGCTCAA 60 

AAAGAAAATT AATCGTTCCA GAAAGAGATA TCCGGGGACT GACAAAAATA TCTGAAGATG 120 

CTCATTCTAC AATTGACTCG AGGATTATTC ACTTCAAATT AGAATTTGCA AATGAATTTT 180 

TAGCAGAAAA ACTACAGGTC GCTTTGCAGA GTCCAGTTTA TAATATTTAC CGCCTGCGTA 240 

TTATTGACGG TAAACCTTAT GTTCTGGAAC AAACTTATAT GAGTACCGAT GTTATTCCAG 300 

GTATTACTGA AGATATTTTA CAAAAATCGA TTTACAATTA CATTGAAGGA AAGTTAGGAT 3 60 

TGCATATTGC CAGTGCTACA AAAATCTTAC GAGCTTCTTC TAGTTCAGAA AATGAGCAAC 420 

ATTACTTGCA GCTCCTTCCA ACGGAACCGG TATTTGAAGT AGAACAAGTG GCTTATTTGG 4 80 

ATAACGGAAC TCCGTTTGAG TACTCGATTA GTCGTCATCG CTATGATTTA TTTGAATTTA 540 

ATTCTTTTGC ATTACGACAT TCCTCCTAGG AGAAAATGTG AAAATGAAGC CAATCTTTTA 600 

CAGACTCTAG TTTAAGAAAA ATTTAAAACA GGGCAAGAAG GTCCCATCTA TGCTTAAATG 660 

GTTTCTCTTT TCTAAATAAG ATGGCTTTAA AAGAGTGATC GTTGTATCCA TCATGTTGAA 720 

A_AATATCTTC GTATAGCTT?; TAGAGTAGGT ACTGAAATTG TT-CaC(JTGAT CTACTTCTTA 780 

TAGTTATTTA GTTTTAAATA GTGTTTCAAA CATTCTTACA CTGACGAGAA GTTTTTGAGT 840 

CTTTTCTTGT AACACATATA GTATACTGTG GTTAGAATAG TAGACTGTGA CTTCTAACAA -900 

ATTGCTAGAA ATGAATTTCA ATCTCCCAAT TTATTTGTTC ATATCTTCTT TTAATATATT 960 

AAATAAATTC TAAATCATAA TCATTTAAAA AAATTTTATT TTTTATTTTT CATTACGAAT 1020 

AATATAGATG AAGGGGAAAG AGTATGAAAA CAGAACTGTT TCTTTTGCTA TTAGTTCAAA 1080 

AGGAGAAAAA ATGAAAGTAG AAAATATTTC GTATAGGGTG GATCATCGTA AATTGTTTGA 1140 

TAATATTTCT TTTGATACTT CGAGTTCAGA CGTGACATTA ATTACTGGTA AAAATGGTAC 1200 

AGGAAAGTCA ACTTTACTAT AGTAGATTGA AACTAGAATA GTACACATCT ACTTCTAAAA 1260 

TATTGTTAGA AATCGATTTG ACTATCCTGA TCTATTTGTC CTGTTCTTAT TTCATTTCAC 1320 

TATATCTCAA ATTGAGTATG ACGAAGTGCG CTCCCATGTC CTGGGAACGC ACTTTCTTCA 1380 

TATTTTTCAT ATTCTTGAAT CCATCGATAA AGACTATTGG GATGAATTTT TAAAGTTGAA 1440 

CTAATCATTT TTACAGGATG AGATTTACAG CAGAG 1475 
(2) INFORMATION FOR SEQ ID NO: 321: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:' 560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

GAAATATATA TACTTCATCT TAATAGTGAG CAAGCTAAAC TTAGCATTTC ATGCCCTCAT 60 

ATGGGATGTT CTTTGACTAA ATAATATGAT TATCGAGATA TATCTGGATA AATGAACTAA 120 

TAAGTCTGAC GCGTAGACTT ATCAAAGTCA TTGGCATACA CCACTATGAA CTCGTTGGTC 180 

TGTTCAAATC CCAACACATT ACCTGAGAAG AAAGTTGCAA TGTTGTTTTT GGTGCGGGTT 240 

TGAATTTAAA AAATTTGTTA TGTAGTACCT AATCTAAGGA ATTAGAACAA TGCCTCTAAT 300 

TTTTCTTTAA TACACTGAAA CATTGATGAT TCTGGCTGTA TTTTTGAAAC AGCTCTTCTT 360 

TGCTCCTGGA AAATATCTTC AGAAGTTATA TTCTCTATTC CTAACGCTAC TTGAGTTTTT 420 

TTTCTAAAAT ATTCTTTTCC GTTGCCATCT TTAGAAAAAT CATAACCTTC CCTATCTACG 480 

CTGTTACACA AATTAGCTAA AAAArACTCT GGGGTTGGGA AAGGAAGATA AGAAaCGTAT 540 

TTAGCCCATA ATCTATAAAG 560 



(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

CCGCCCGGCC ACCGCTGCCT ATCCTCGGGA GAGGGTCACC TGGAGTGAAC CTAGAACGAT 60 

AGACACGGTG CGGTACGACC TCGTACTACT TTCGCCGACG GCCTCGTCCG TTGTCATCCA 120 

CGAACTGATC GGACATGGGT GCGAACACTT CAGAGAAAAA ATCGTTGGAC TGCGTGTCGG 180 

GCCTGAGGAA CTACGGGTGG TGGCTTTTCC GAAGAACGGC TCCGGGTTTG ATGACGAGGG 240 

TACACCCTCC GAAGAGATTG TACTTGTGGA GAACGGCATT GTGAGGCACG CTGTCAGGGA 300 

TCGGGCGACT GGAGGAATGG CGCCTTTTTC CGGTTTGACC AAAGTGGCAT CACATGGTGT 360 

CAAACCTGGC TCAAGATGTA CGCATCTCAA GGCGGAAGGG GAATCGTCAC AGGAAGGAGT 420 

TACCGGAGTA CCCGCCGAAC GCACCGTTTG GATAGAGCAT TTTTCTGCAG CGAACTACCA 480 

TTCAGGTCGA GCCTTTTTCA GGTCTGGCCT TGCCTGGGTA GGCAGCCGAG AAGAACTCTT 540 
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ATATCCCTTA ATGCCTTTCA CCATGTCAAT TGATATCTAC GAACTGGCCA GCTTATTGTG 600 

GCATTTAGAC GGTCAAACGG AACGAGCACG TAGGGTACTG TGC 643 

(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 



GGTACCCACT 


CATTCTTGAT 


GAATTGTGAA 


CAGTTGCCCT 


TGGGTCGTTT 


TGCGAGTTGA 


60 


AGTCAAGAAG 


AGGAAAAAAA 


CAAAAAGGAG 


AAATACTCAT 


GGCAGTAATT 


TCAATGAAAC 


120 


AACTTCTTGA 


GGCTGGTGTA 


CACTTTGGTC 


ACCAAACTCG 


TCGCTGGAAT 


CCTAAGATGG 


180 


CTAAGTACAT 


CTTTACTGAA 


CGTAACGGAA 


TCCACGTTAT 


CGACTTGCAA 


CAAACTGTAA 


240 


AATACGCTGA 


CCAAGCATAC 


GACTTCATGC 


GTGATGCAGC 


AGCTAACGAT 


GCAGTTGTAT 


300 


TGTTCGTTGG 


TACTAAGAAA 


CAAGCAGCTG 


ATGCAGTTGC 


TGAAGAAGCA 


GTACGTTCAG 


360 


GTCAATACTT 


CATCAACCAC 


CGTTGGTTGG 


GTGGAACTCT 


TACAAACTGG 


GGAACAATCC 


420 


AAAAACGTAT 


CGCTCGTTTG 


AAAGAAATTA 


AACGTATGGA 




ACTTTCGAAG 


4uu 


TTCTTCCTAA 


GAAAGAAGTT 


GCACTTCTTA 


ACAAACAACG 


TGCGCGTCTT 


GAAAAATTCT 


540 


TGGGCGGTAT 


CGAAGATATG 


CCTCGTATCC 


CAGATGTGAT 


GTACGTAtTG 


ACCCACATAA 


600 


AGAGCAAATC 


GCTGTTAAAG 


AAGCTAAAAA 


ATTGGGAATC 


CCAGTTGTAG 


CGATGGTTGA 


660 


CACCAATACT 


GATCCAGATG 


ATATCGATGT 


AATCATCCCA 


GCTAACGATG 


ACGCTATCCG 


720 


TGCTGTTAAA 


TTGATCACAG 


CTAAATTGGC 


TGACGCTATT 


ATCGAAGGAC 


GTCAAGGTGT 


780 



(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 
CGGGAAAAAT CAGATTGTGG GTTCAGATAT CGAATTAGCC AAGGCTATCG CAACAAAACT 60 
AGGTGTCGAA TTGGAACTAT CTCCCATGAG TTTTGATAAT GTACTGGCTA GTGTTCAATC 120 
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AGGAAAAGCC GACCTTGCCA TATCAGGTGT TTCTAAGACA GATGAACGGA GCAAGGTGTT 180 

TGACTTTTCC ATTCCCTACT ATACTGCAAA AAATAAACTC ATTGTCAAAA AATCTGACTT 240 

GACTACTTAT CAGTCTGTAA ACGACTTGGC GCAGAAAAAG GTTGGAGCGC AGAAAGGTTC 300 

GATTCAAGAG ACGATGGCGA AAGATTTGCT ACAAAATTCT TCCCTCGTAT CTCTGCCTAA 3 60 

AAATGGGAAT TTAATCACAG ATTTAAAATC AGGACAAGTG GATGCCGTTA TCTTTGAAGA 420 

ACCTGTTTCC AAGGGATTTG TGGAAAATAA TCCTGATTTA GCAATCGCAG ACCTCAATTT 480 

TGAAAAAGAG CAAGATGATT CCTACGCGGT AGCCATgAAA AAAGATAGCA AGAAATTGAA 540 

AGAGGCAGTT CGATAAAACC ATTCAAAAGT TGAAGGAGTC TGGGGAATTA GACAAACTCA 600 

TTGAGGAAGC CTTATAAGCA TCCA 624 



(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1237 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 



TCTTATGAAG 


CCGAAGCGTG 


ATTTATGGCG 


GATAGGTTTG 


GTCTGCAGAA 


AGTGACAAAT 


60 


CTAGTGCCAT 


CAGCGTATAT 


GGAATCTnTG 


GCTGAGAAAC 


AGTCCCGGGG 


TGAACTGACT 


120 


TATGAGCAGG 


TTTATGAGGA 


TGCAACGGCT 


TATCATCATA 


CCATTGATGC 


GAGTACAGAG 


180 


GAGGCAGACT 


TGGTTTCTCT 


ACGTATTGTA 


GAACTATTGT 


CTCGAAGAGG 


CTTTAGCTTC 


240 


AGTCCTGCGA 


TCTTACTTGC 


TATTCATAAG 


GAGTTGTTTC 


AAGATATATT 


TGAACCCTCG 


300 


ATTCCGGTAG 


GTCAATTTCG 


TCAGACTAAT 


ATCACAAAGA 


ATGAACCTGT 


tttgaatggt 


360 


GAAAGTGTTG 


TGTACTCTGA 


TTACTCCATG 


ATTCT^TGA 


CCTTGGATTA 


TGATTTTAAT 


420 


CAGGAAAAAC 


AAGTTGCATA 


TGCGACACTA 


ACCCAGGCGG 


ATATGGTTAA 


AAAAATCCAG 


480 


CATTTTATTT 


CAGGAATCTG 


GCAGATTCAT 


CCATTTCGCG 


AAGGAAACAC 


TCGGACGGTA 


540 


ACGGTATTTT 


TGATTCAGTA 


TCTTCGTGAG 


TTTGGTTTTG 


atattgat;^ 


TACACCATTT 


600 


CAGCAACATT 


CCAAGTATTT 


TCGTGATGCC 


TTAGTGTTAG 


ATAATGCAAA 


GATTTTACAG 


660 


CGACGTCCTG 


AGTTTTTAAC 


AGCTTTTTTT 


GAAAATCTCT 


tgctcggtgg 


TCAAAATGAT 


720 


TTGTCTTCAG 


AAAAAATGTA 


TCTAGATTTA 


GACCTCGATC 


tttcataatc 


CTAATACTGA 


780 


GTAAACATTG 


AATTTTAGGA 


AAAAATGAAG 


TAAATATTCT 


CACAAGAAAA 


CGTATATCAT 


840 


CAAAGTTTGG 


CTCTTTGTCA 


ATTGTAGTGG 


GTTGAAGAAA 


AGCTAAGTTC 


GAGAAAGGGC 


900 
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AAATTTCGGC CTTTCCTTTT TGATGTTCAG AGCGATAAAA ATCCGGTTTT TTGAAGTTTT 960 

CAAAGTTTCG AAAACCAAAG GCATTGCGCT TGATAAGTTT GATGAGATTA TTGGGCGCTT 1020 

CCAGTTTGGC ATTAGAATAG TGTAGTTGAA GGGCGTTGAT AACCTTTTCT TTATCTTTGA 1080 

GGAAGGGTTT AAAGACAGTC TGAAAAATAG GATGAACCTG CTTAAGATTG TCCTCGATAA 1140 

GTTCGAAAAA TTTCTCCGGG TCCTTATTCT GAAAGTGAAA CAGCAAGAGT TTGAAGAGCC 1200 

GATAGTGATG TATCAAGTCT TGTGAATAGC TCAAAAG 1237 
(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 
TTTGATTTTT CTGAATTAGA AGAGATTGAA TTGCCTGCAT CTCTAGAATA TATTGGAACA 60 

AGTGCATTTT CTTTTAGTCA AAAATTGAAA AAGCTAACCT TTTCCTCAAG TTCAAAATTA 120 

GAATTAATAT CACATGAGGC TTTTGCTAAT TTATCAAATT TAGAGAAACT AACATTACCA 180 

AAATCGGTTA AAACATTAGG AAGTAATCTA TTTAriACTCA CTACTAGCTT .VwAACATGTT 240 

GATGTTGAAG AAGGAAATGA ATCGTTTGCC TCAGTTGATG GTGTTTTGTT TTCAAAAGAT 300 

AAAACCCAAT TAATTTATTA TCCAAGTCAA AAAAATGACG AAAGTTATAA AACGCCTAAG 3 60 

GAGACAAAAG AACTTGCATC ATATTCGTTT AATAAAAATT CTTACTTGAA AAAACTCGAA 4 20 

TTGAATGAAG GTTTAGAAAA AATCGGTACT TTTGCATTTG C 4 61 
(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

TAACATTTAG GTACCTCTTC TTAACAAAGT TCAATAGTAA CAATTAATAT TTTAAACAAT 60 

ATATCAAACA TCAATGACTA GAATACTTGC ATCATCCTTC TTTCCATAGA TTGGATCAAT 120 

AGCAGAAGAA TTAAATCTCA TCTTAATTAA CTCTTCAAAA GTTTTATTTT GATTATTTTG 180 
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ATAGAATTCA TAAAAGCCAT CGCTCATTAA AACAATTTGT TCACTAGTAA CATCTATTTG 240 

ATTAATAATA GCATGGTCTA AAAATCTCTC ATCCAACGAA CCTATCCAGT ACCCACTCGG 300 

TTGATTAGAT AATTTTCTGA TTTTTTGTAA AATAATTTTT TTATTTAAAA CACTATTTGT 360 

ACCAATTGAA TCTTTTATCT CATTTTTCCC TTTTTCAAAT AAGTTATCTA CTCTATGATC 420 

AGTTATTTCC ATTTCGTTTA CTAACATGAC GCAGTCACCT AGCATCATAT ACTCCAACTT 4 80 

TTTTTCTGAA AGTTTAGCAA ATATTGGTAA GCGATAATAT AGTATATTGA AACTAGAATA 540 

GTACACCTCT ACTTCTA7VAA CATTGTTAGA AATCGATTTG ACTGTCCTGA TTGATTTGTC 600 

CTATTATTAT TTCATTTTAC TATACTCTGT TAATTTATAT GAGTTTAAAC CGATTTCATC 660 

TTTAACCTCG AGTAAAGCAG TTTCAAATAT TTGTTTAAGA GTTTTTGATT CTTTACAATT 720 

AACCGACAAA CTTTCTGATA AAATATGTAC AACTTCTGAG ACTGAATAAC CTATCTCCTC 780 

TTTAGAATTA TATAAATCTG TAGCTCCACC AATAATCCAA AAATACTGAT TTTGTGAACC 840 

TACAATATCC TCATTTTCTA CGGAACTTCC TTGTATCGAA CAAATTTTAT TTATCTTTAC 900 

CATAATACTT CAACCCTTTT AGTGTCAAAA GTAAACCAAT TCCTGTCACT GTTAAGAATA 960 

GTTCCATAAT CTTATTCGAA CCAGTCTTTG GTAATTTTTG TTTkACATCT ACTATyTCTT 1020 

TAGATTTATT AATATGATTT TCAGTTTCTC TGCCATCTCC AACTATTTTA TAGTTTACTT 1080 

CTTCTGTCTT ATTATCTTGT TTATTGTCGA TCTTGTCATT CATTTGTCTA TTATCTTTAC 1140 

TTGAGTTAAA CTCTCCGTTC TTCTGGTTAC TATCAATTAC ATTATTTGAA TTAGATTGTT 1200 

TTTCCTCTTT GTTTTTTTCT TTTTCGTTTT TATCACTTAA ATTATTTGTT ACAATTTTGT 1260 

AAAGCCCATT CTCCGTTACA ATATTGAAAT TACCATCGCT ATCACGTATA ACAGGTTCTT 1320 

TCCCATTTGC ATTAGATTTG ATGAATGATA TATACTTACC GGATAAATTA TAAAATTGGT 1380 

TATTTAAAAC GGTTATTTTA CCCTTTGAAT CCTCAATAAC AATTCCTTCT TTACCC 1436 



{2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 646 base pairs 

(B) TYPE: nucleic acid 

(C) STFANDEDNESS : doiible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 

CCGGCAGACA GGAGAAGGTG TTAAATATCA ATCTCAAATG GTTCGTCAAT GGTTTCTGAT 60 

ACGTATTTTC CGTCTTTCTT CCGTTGCTTG ACACACTCTG TGAGGAGATA TTCGATTTGC 120 

CCATTGACTG AACGAAAGTC GTCTTCTGCC CATGATGCGA GTGCAGCGTA TAACTTTGTT 180 



wo 98/18931 



PCT/US97/19588 



1353 

GAGAGTCGAA GGGGGATCTG CTTTTTTTTA GCTTCAGCCA TCTTTAGTAA AGGCTTCCTG 240 

TGTTGACAAT TGGTTGTGCA TCATGATTGC CACAAAGAAC GACAAGGAGA TTTGAAACCA 300 

TGGCAGCTTT TCGTTCTTCG TCAAGTTCTA CCAATTCCCC TTCATTGAGC CGTTCTAGTG 360 

CCATTTCAAC CATTCCTACA GCACCATCTA CAATCATCTT CCGTGCATCA ATAATGGCAG 420 

ATGCTTGTTG GCGTTGAAGC ATAACGGCAG CAATTTCTGG AGCATAAGCT AGGTAAGTGA 480 

TACGTGCTTC AAGGATTTCC AAGCCAGCAT CCTCAACACG ACTTTGGATT TCTTCACGAA 540 

TACGGGTAGC AACAATTTCG CTAGAGCCAC GGAGACTACC TTCATCTGCG TGCCCATCAC 600 

CCGGAGTATC CACATTAGGA GACACATCGT AAGGATAGAT GCGGAC 646 
(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 



GTTGCAGGTG 


CAGTAGGTGT 


TACTTCAGAT 


ACATTTGAAC 


GTGCAGAGGC 


TCTTTTTGAG 


60 


GCAGGAGCGG 


ATGCGATTGT 


TAT-TGAtaCt 


GCA(JA'lt;tiTC 


ATTCTGCAGG 


TGTCTTGCGT 


120 


AAAATTGCCG 


AGATTCGTGC 


TCATTTCCCA 


GATCGGACTT 


TGATTGCTGG 


AAATATTGCT 


180 


ACTGCTGAAG 


GTGCACGTGC 


CCTTTATGAA 


GCGGGTGTAG 


ACGTTGTTAA 


GGTTGGTATT 


240 


GGACCAGGTT 


CTATCTGTAC 


TACTCGTGTG 


ATTGCTGGTG 


TTGGTGTTCC 


GCAAGTAACA 


300 


GCTATCTACG 


ATGCTGCAGC 


TGTTGCGCGC 


GAATATGGTA 


AAACGATTAT 


TGCTGACGGT 


360 


GGGATCAAGT 


ATTCTGGAGA 


TATTGTAAAA 


GCACTTGCTG 


CAGGTGGAAA 


TGCTGTTATG 


420 


CTTGGATCTA 


TGTTTGCTGG 


AACTGATGAA 


GCTCCAGGCG 


AAACTGAAAT 


CTTCCAAGGA 


480 


CGTAAATTCA 


AGACTTACCG 


TGGTATGGGA 


TCAATTGCTG 


CTATGAAGAA 


AGGTTCAAGC 


540 


GACCGTTATT 


TCCAAGGTTC 


TGTCAATGAA 


GCAAACAAGC 


TTGTTCCAGA 


AGGAATTGAA 


600 


GGTCGTGTTG 


CTTATAAAGG 


AGCGGCAGCT 


GATATTGTTT 


TCCAAATGAT 


TGGTGGTATT 


660 


CGCTCTGGTA 


TGGGTTACTG 


TGGTGCAGCT 


AACCTTAAAG 


AACTACACGA 


TAATGCTCAA 


720 


TTTATTGAAA 


TGTCTGGTGC 


TGGTTTGAAA 


GAAAGCCATC 


CTCATGATGT 


GCAAATTACT 


780 


AATGAGGCAC 


CAAATTATTC 


TATGTAAAAA 


ACAATGAAAA 


GAACTCCAGT 


GAAAACAGGA 


840 


GTTCTTTTAC 


AATGTTGTCA 


ATTTCCATTT 


ACAGCAGCTT 


TACCATCCTG 


AATAGTGAAG 


900 
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ATACTTAGAT TTTCTGGCAG ATTTTGAAGA TGGTCTAAGC TTGTTGTTGT GATAAAGGTT 9 60 

TGGATTGATT GAGAAATCGT TTCTAATAAT TTTAACTGTC TAGTGTTGTC AAGTTCACTC 1020 

ATCACATCGT CAAGCAGTAA TATAGGAGAT TCTGTGGTAA TGCTTTCCAT TAATTCGATT 1080 

TCTGCTAATT TTATCGAGAG GACGAGACTA CGATGTTGAC CTTGGCTTCC GAAACTAGCA 1140 

TCCATCCCAT TTATATAAAA AGAAATGTCA TCTCGATGAG GACCGACACC AGTATTCTTT 1200 

TTAAATAAAT CTCTGGATCT ACTTTTTTCT AAAGCAATTT TGAAAGATTC GGATAAGTTT 1260 

TGTTTGTCAG TTATATTGAC AGAAGATTGA TAGGATATTG ACAACTCTTC GATCTGATTA 1320 

GAGAGTTCAA AATGTTTCTT ACGCCCAAAT GATTCTAGTT TTTTTATGAA ATCTAAGCGG 1380 

TGATTCATTA CACGACATCC ATAATCAACT AGCTGATCAT CTAACACAGA AAGGAATGTT 1440 

TCATCTATTT TTTGAGCTGA TTTTAGGTAA GTGTTTCTTT GCTTTAGGAT GTGGTTATAA 1500 

TTGGTTAAGT CAGATAAATA GATTGGCTTA ATTTGCCCAA GTTCCATATC AATGAATTTT 1560 

CGTCGAATCG AAGGTGCTCC TTTAATTAGT TGTAAATCTT CAGGAGCAAA TAAGACAACA 162 0 

TTCATGTGTC CTACATAATC TGAAAGGCGT GCC 1653 



(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

GAAACACTGT ATTTCAAAGC ATTTTTTGTT AGTTTAAAAT TACTCCCATT CTTCTTTTCC 60 

AAACGTACAA TATATCCAAA ACCATTCAAA ATACTAGATT CTATTTTTTA TAATATCACT 120 

AAATCCACCT AATTATAGGA CGTTTTCAGA TTTTTAGTCC CAGTCCCAGT ACCGGAGAAA 180 

TATTGTTTTA ATATAATATC TCTTTTTGTC TTCTAAGCTC TTAAAAGCAA AAGAACAAGT 24 0 

AAAGAGTCAA GACAAGGATA AAAAGTCCAT ATTAGGGCAA ATAAAAAGCT TTAAGACAGA 3 00 

TGACAAATCT AAGTCAAATA AGAAAGACCA TAGCAAAGGT GCAGAGAGAT AAATATTGGC 3 60 

GGTCTTCGGA CTGCCTTTAT TTTTTTATCC ATTTTTCAAA TCAAATTTAT TCAGACTATA 420 

TATGCACATA TACACTTAAA TTCATATAAA AACATGGCTT GTAAAAAATT ACTTTAATCA 480 

CAATAATCGC ATTTAAAATT GTGATGTTTG CAAGCTAAAT TACGGACTTC ACTTGGAAGT 540 

TTTCCCTTGT ATCTTTTATA ATAGATAGAA AATTTGCTGG CAGATGAATA TCCAACAGAT 600 

TCTGCTATCT CTTTTATAGG TAGTTCAGTG TTTAAAAGAA GAGTTTCAGC TACATTCATT 660 
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CTTTTTCTTT GAGTGTACTC TGTAATGCTT TGACAATATT TTTCCTTAAA TAAATTTTTT 720 

AATTTAGTAC CACTCATTTT AGATATTTTT TCAAGCGTGC CTTGATTTAC ATTCGTTGCA 780 

AAATGATCAT CTAAGAATCT TGCTACATCT TCAAGTGCTT TATCATCATC AATTTCAATT 840 

TTATATTTTT TTCTATTTAA GTATGTGTCA ATTACTATAC TTATCCATTC ATTTGCCTTT 900 

GCTTTAAAGA AAAAATCAGC GGCAGGAGCG TCCATCTTAC AATTTAATAT TTCCATTGCC 960 

ACTCTTTCTA AGGCCTTTGT AAGTATTATT TGATTCGGTT GAAGCAAGGT TGAATAAAAA 1020 

GATTCTGGAT TAATGTTAAT AGATGCTAAA TGTTTTTCTA TTAGCTCTTT TTTAAAACCm 1080 

ATGGAAACAG CAAGATAACA ACAATTCTCG TGTAATAAAA AAACAAAATT ATCTTTTATA 1140 

TTATCAAAAT CAAAAGTACA TAGAGAGTTT GCGGTAATAG TTTGATACGG ATTAAACTTT 1200 

TCTCCGTTTG CACTGACAAT GTAACTTGAA TAAATTGAAA CATAGTCTGA CAT ACT AT AA 12 50 

GTGCTATTTT GAACTACTTC CTCTTTGATA TAAAAATCAT GTATATCGAT AATGAAGATG 1320 

CCTCCTTCAT AAAACCGGTA 1340 



(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 607 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS : double 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

TATGTTCGTG ATGAGTTTTT AAGTAGGAAA AACGTGCTAA CCTCTCAGAT TTTGGAACTT 60 

GTAAAAGAAA CTCTTTTTTC ACCCGTAGTA GTTGATAATG GGTTTGATCC GGCCTTATTT 120 

GAAATTGAGA AAAAACAATT GCTAGCAAGT TTAGCAGCTG ATATGGATGA TTCTTTTTAT 180 

TTTGCACATA AAGAATTGGA TAAATTGTTT TTTCATGATG AACGTCTTCA ATTGGAATAT 240 

AGTGATTTAC GAAATCGTAT TTTAGCTGAA ACTCCACAAA GTTCTTATTC TTGTTTCCAA 300 

GAATTTTTAG CCAATGATCG AATAGATTTC TTTTTCCTAG GTGATTTTAA TGAGGTTGAA 360 

ATTCAAAATG TATTAGAATC ATTTGGCTTT AAAGGTCGAA AAGGAGATGT GAAGGTTCAG 420 

TATTGTCAAC CTTATTCTAA TATCCTTCAG GAAGGTATGG TTCGGAAAAA TGTGGGACAA 480 

TCCATTTTGG AATTAGGTTA TCATTACTGT TCTAAATATG GTGATGAGCA ACATTTACCC 540 

ATGGATTGAA TGAATGGTTT ACTTGGTGGA TTTGCTCACT CTAAGCTCTT TACAAATGTC 600 

CGGGAAA 607 
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(2) INFORMATION FOR SEQ ID NO: 332: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 



TTAAAATACC 


GAATTTTGTT 


TTGTCCTCTA 


TTTCAACATT 


GTGAATCGCC 


TCAGGCAGAG 


60 


AACCGATACT 


AAAGATATAA 


CCAAAATAGT 


TGTCATTTGC 


TTTACCGATA 


TCAATCTTAT 


120 


TGGTTAAATC 


AAAATCCAGT 


TCGTCAATTG 


CGCCATCGAT 


GTCTTGATTG 


ATTTCCAAAA 


180 


GTTTTGTAAT 


GAGGTTACCC 


GTACCGCCTG 


GGATAATCCC 


TAACTTAGGA 


ATGTAGTCTC 


240 


TCTCATCAAT 


ACCTGAAATG 


ACTTCATTGA 


CAGTTCCATC 


TCCACCAAAC 


ACAACCACTG 


300 


CATCATACTG 


CTCACGAGAA 


GCTTCTTCAG 


CAAAATGTGT 


TGCATCCAGC 


GCTTTTTCGG 


360 


TAATTTTGGT 


TTCAACATAT 


TCAAAGTATT 


CTTTTGCTTT 


ATTCTCCAGC 


TTTTCTTTGT 


420 


AATCCAAAGC 


CTTCTCGCCA 


CCAGAAGTAG 


GGTTGATAAT 


TACCATTGCT 


TTTTTCATTG 


480 


ATTTTATCCT 


TAATTTTAAA 


CAGAAATGTT 


TACATTTCGT 


CGTATGCAAG 


TAAATGTAAT 


540 


CCTATTATAC 


AATGAAAATA 


CAGAAAAGAG 


AAATCTGACG 


TACTGGAGAT 


TAATACGCTT 


600 


TTATTCTATT 


TTCCCATCGC 


CTAACTACAT 


CCTTTAAGGG 


TTCATCCAAG 


TAAGAATAGG 


660 


CCTTATCCTT 


GATCCAATCA 


GGAATACCGT 


AAGCTGCCTC 


TGCTAwGCTA 


CAAGTGATTG 


720 


CTGCGAGAGT 


ATCACTGTCG 


CCACCAAGTG 


AGATGGCATT 


TCTTATCGCA 


TCTTCGAAGT 


780 


CTCTACTTTC 


AAGAAAGGCG 


ATAATGGCTT 


GAGGGACAGT 


TTCCTGACAT 


GTTTCGTTAA 


840 


AACGATAGTT 


AGGACGGATT 


TCATCTAAAG 


TTTGAGATAG 


ATTGTAATCG 


TATTCTTTTT 


900 



(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

CCTTTCTGGC ACACTGGTCT TGGAATACGG CAAAACCTCT GAAAATATCT ATGCTGGAAT 60 

GGACGAGGAA TACCGTCGTT ATCAGCCTGC CATCATCACT TGGTACGAAA CAGCCAAACA 120 

TGCTTTTGAT CGCGGACAGA TTGGCAAAAT ATGGGTGGAA TCGAAAACGA CCTCAAGGGC 180 
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GGTCTCTACA GCTTTAAATC CAAGTTCAAT CCGACCATTG AGGAATTCGC TGGTGAGTTC 240 

AACCTGCCAA CTAATCCTCT TTACCACCTC TCCAATCTGG CCTACACTCT CAGAAAGAAA 300 

CTGCGCAGaA GcATTAACAG AAAGGAAGCC TATGACCTTT AAACTTCTCA GCCAAGAAGA 360 

ATTCATCCAG CATACCTCAG CTAGATCCCA ACGCTCTTTT ATGCAGACCG TAGAAATGGC 420 

AGAGCTGCTG AGCAAGCGTG GCTTCAGTAC CCAGTATGTC GGCTACACTG ACCCACAAGG 480 

GAAGGTAGTG GTGTCAGCTG TCCTCTACAG CATGCCTATG ACTGGTGGCC TTC 533 

(2) INFORMATION FOR SEQ ID NO: 334: 

SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 544 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

CCAGCAAACT AGGAAGCTAG CCGTAGTTGC TCAAAGCACA GCTTTGAGGT TGTAGATAAG 60 

ACTGACGAAG TCATGTACAA AACACTGTTT TGAGGTTGCA GATAGAACTG ACGAAGTCAC 120 

TCAAAACACT GTTTTGAGGT TGCAGATAGA ACTGACGAAG TCACTCAAAA CACTGTTTTG 180 

AGGTTGCAGA TAGAACTGAC GAAGTCAjinA ACCACACCTA CGGCAAAGTG AATCTGAAGT 24 u 

GGTTTGAAGA GAGTACAACT TGTCTTTTAG AAAAGGAGCC TATAATGAAA GTCTTTCAGC 300 

ATGTAAATAT CGTGACTTGT GATCAAGATT TCCATGTTTA TCTTGATGGA ATCTTAGCAG 360 

TCAAGGATTC TCAAATCGTC TATGTCGGTC AAGATAAGCC AGCGTTTTTA GAGCAAGCTG 420 

AGCAGATTAT AGACTATCAG GGAGCTTGGA TTATGCCTGG TTTGGTCAAT TGTCACACCC 480 

ATTCTGCAAT GACAGGTCTG AGAGGGATCC GAGATGACAG CAATCTCCAT GAATGGCTCA 540 
ATGA 

(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



544 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 
CCAGGAACTC AAATGTAAGT AGGGGTTCCT TTTTTGTATA TTTTTCAAAT AACGCCTCTA 



60 
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CACTATTTGT AGCAAATTCA CCAACTACAG TTGTATCTTA GTTAAAATAA GTTAGAATAT 120 

GTAAGTGAGT ACCAGATATA CCAAGACATC GTCACCATCT AAGGTATATT CAAAATACAA 180 

AAGTTGACCA ACTAGATTTC TGAATATCCT TATATATCCA TTCTTAAAAT TGGTTTAAAT 240 

AGCGTAGTCT TTTAAACTAG TTTTGAGAAT CCAAAAAATC TTCCTACATA TGTAAGAAGA 300 

TTTTTTAGTT CAGAATGATT AGaTTTAGCT AATGGATACC TATCCTACC 349 



(2) INFORMATION FOR SEQ ID NO: 33 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

CTCCGATAAC CACACCAGCA ATGGAAATAA TTCCATCGTT AGCATCAAGA ACACCCGCAC 60 

GCAGGATATT TAAACGACCT GCAAAATTTG AATCAATTTC GTGATTTGTT TCTGACGCTA 120 

AATTTCAAGT TCAAGTTAGC CATCAAGAAG TCTTCTCTGG GTGACTTGTA GTCCAAGCAT 180 

TTTTTAGGAT AGTTGTTAAT CCACTTTTCG ATGAATGCGA CTTCTTTGGG AGTCATTTTC 240 

TTGGTTCCCT TAGGTAACCA TCTACGAATG AGCCTGTTGT GATTCTCATT AGTTCCCGGG 3 00 

ATCCTCTAGA GTCGACCTGC AGGCATGCAA GCTTGGCACT GGCCGTCGTT TTACAACGTC 3 60 

GATGACTGGG GAAAACCCTG GCGTTACCCA ACTTAATCGC CTTGCAGCAC ATCCCCCTTT 420 

CGCCAGCTGG CGTAATAGCG AAGAGGCCCG CACCGATCGC CCTTCCCAAC AGTTGCGCAG 480 

CCTGAATGGC GAATGGGGCC TGATGCGGTA TTTTCTCCTT ACGCATCTGT GCGGTATTTC 540 

ACACCGCATA TGGTGCACTC TCAGTACAAT CTGCTCTGAT GCCGCATAGT TAAGCCAGCC 600 

CCGACACCCG CCAACACCCG CTGACGCGCC CTGACGGGCT TGTCTGCTCC CGGCATCCGC 660 

TTACAGACAA GCTGTGACCG TCTCCGGGAG CTGCATGTGT CAGAAGTTTT CACCGTCA'J'C 720 

ACCGAAACGC GCGAAACGAA AGGGCCTCGT GATACGCCTA TTTTTATAGG TTAATGTCAT 780 

GATAAGGATG GTTTCTTAGA CGTCAAGTGG CACTTATCGG GGAAATGTGC GCCGAGACCC 840 

TATTTGTTTA TTTGTCTAAA TACATTCAAA TATGTATCCG CTCGTGAGAA AATAAACCTG 900 

ATAAATGCGT CAATAATATT GAAAAATGAA GAGTATGAGT ATTCTACATT TCCGTGTCGC 960 

CCTTATACCC TTTTTTGCGG CATGTTGCCT TCCTGTTTTT GCTCACCCAG AAAACGCTGG 1020 

TGAAAGTTTA AGATGCTGAA AAATCATTTG GGTGCACT^C TGGGGTTACA TCCAACTGGA 1080 

ATCTCCAnCA GCAGTTAAGA TCCTCTGACA GTTGTACACG CCGCAAGAAC TATTCCCGAT 1140 



wo 98/18931 



PCT/US97/19588 



1359 

GAATGAGCAA CTTTTAAAAG TCCTGCGAAT GTTGGGGCGG TAATAATCCC CGTGTTGTAG 1200 
GCCCGG 1206 
(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 



CTGCTCAACT 


CAGACAGTCA 


AATTTCTGAC 


TTTACCAAAA 


GAACCATCAA 


AAAAGTTGCT 


60 


GAAAAAGGCC 


ATCAGGTTAT 


TATTACGACA 


GGTCGCCCTT 


ACCGTATGTC 


AAAAGATTTT 


120 


TACCGTGAAC 


TGGGCTTAGA 


CACTCCTATG 


ATTAACTTCA 


ACGGATCCCT 


TACTCATTTA 


180 


CCAGACCAAG 


TTTGGGATTT 


TGAAAAGTGT 


TTGACTGTAG 


ACAAAAAATA 


TCTGCTAGAT 


240 


ATGGTTCAAC 


GTTCAGAGGA 


CATTCAAGCC 


GATTTTATCG 


CTGGAGAATA 


TCGTAAAAAA 


300 


TTCTACATTA 


CAAATCCCAA 


TGAAGAAATT 


GCCAATCCCA 


AACTATTTGG 


TGTAGAAGCT 


360 


TTCCAGCCTG 


AAGATCAATT 


CCAGCCTGAA 


TTGGTGACCA 


AGGACCCTAA 


CTGTATCCTC 


420 




GAGCCAGTGA 


CAAATATTCC 


TTGCiUAAAAU 


AAATGAACGC 


CTTCTACCAG 


480 


CATCAACTTT 


CTATCAATAC 


CTGGGGAGGT 


CCGCTCAATA 


TCCTTGAATG 


TACCCCAAAA 


540 


GGTGTCAACA 


AGGCCTTTGC 


TTTGGACTAC 


TTGCTCAAGA 


TAATGAATCG 


TGACAAAAAA 


600 


GATTTGATTG 


CCTTTGGAGA 


TGAACACAAT 


GATACCGAAA 


TGCTCGCTTT 


TGCTGGGAAG 


660 


GGTTATGCCA 


TGAAAAATGC 


CAATCCAGAG 


CTACTCCCTT 


ATGCAGATGA 


GCAAATTTCC 


720 


CTTACCAACG 


ACCAAGATGG 


GGTTGCCAAA 


ACCCTACAAG 


ACTTATTCTT 


ATAACCTATA 


780 


CTGATACTCA 


ATGAGGGGCA 


AAGAGCGAAC 


TTA 






813 



(2) INFORMATION FOR SEQ ID NO: 338: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 
CCTAGATAAA TGATATAATT CTATTATTGT TCGTAAAAAT TAAAAGGAGA TTGATGATGG 60 
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ACAAATTATT TAAACTAAAA GAGAACGGTA CAGACGTTCG TACAGAGGTT CTCGCTGGTT 120 

TAACAACTTT CTTTGCAATG AGCTATATTC TCTTTGTAAA CCCACAAATA CTTTCACAAA 180 

CAGGAATGCC TGCTCAGGGC GTCTTCCTAG CGACGATTAT TGGTGCAGTA GCGGGTACCT 240 

TGATGATGGC TTTTTATGCT AACTTACCTT ATGCCCAACC GCCAGGTATG GGACTCAATG 300 

CCTTCTTTAC CTTTACAGTT GTATTCGGGC TTGGTTATTC TTGGCAAGAA GCCCTAGCTA 360 

TGGTCTTCAT CTGTGGGATT ATTTCATTGA TTATTACCTT GACAAATGTT CGTAAAATGA 420 

TCATTGAATC GATTCCCAAT GCTCTTCGCT CAGCTATTTC AGCTGGTATC GGTGTCTTCC 480 

TTGCCTATGT AGGGATTAAG AATGCTGGAC TTTTGAAATT CACGATTGAT CCAGGCAACT 540 

ATACTGTTGT AGGAGAAGGG GCTGACAAAG CTCAAGCAAC GATTGCAGCA AACTCTTCAG 600 

CAGTTCCAGG ATTGGTCAGC TTTAATAATC CAGCTGTTTT AGTGGCTCTT GCAGGACTTG 660 

CCATTACTAT CTTCTTTGTC ATC 683 



(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

CTACTTTACA TGGAAGTAGT CACTGAATTC CAGTTAGAAA TTACTTTGTA ACTACGTTTT 60 

GAGGAGGAGT AAAATGCTTT CCTACGTTCG ATATTACCCA CTAGCGATAG CTAAATTAAT 120 

GTGTCTGTGC TCTCCTAAAA TCTGCTGATT TATTACTGAC TAATACAGGA GGTTTTTTTT 180 

ATGgACAGAC AATCATATCT GCTATTGGTG TTTATATTTC CACCAGTATC GATTATTTAA 240 

TTATTTTAAT TATTTTATTT GCACAGCTAT CACAGAATAA ACAGAAATGG CATATTTATG 300 

CGGGGCAATA TCTAGGCACA GGCTTACTTG TAGGGGCGAG TTTAGTTGCT GCTTATGTCG 360 

TTAATTTCGT GCCTGAAGAA TGGATGGTTG GATTGCTTGG TTTAATCCCT ATCTATTTAG 420 

GGATTCGCTT TGCAATTGTT GGAGAAGATG CGGAAGAAGA AGAGGAAGAA ATTATTGAAA 480 

GATTAGAACA AAGCAAGGCA AATCAACTGT TTTGGACAGT TACATTGCTG ACAATTGCGT 54 0 

CTGGCGGAGA TAATTTAGGT ATCTATATAC CTTATTTTGC TTCGTTAGAT TGGTCACAGA 600 

CCCTCGTGGC CTTGCTTGTG TTTGTAATCG GCATAATTAT CTTTTGCGAG ATTAGTCGGG 660 

TGTTATCCTC TATTCCGTTA ATATTCGAGA CAATTGAAAA ATACGAGCGA ATCATTGTGC 720 

CCTTAGTATT CATTCTACTT GGACTATACA TCATGTATGA AAATGGCACG ATAGAGACTT 780 
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TTCTGATCGT GTAGATTTTT TTGTTTCACT AGGGATTTAG CCCGAGCTCA AATCAGCTCT 840 
CTGATTTTCA OA 852 
(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 

CCGCACAAAA GCGCATAGTA TCAAGATTCT ATAAAGCCTT GATACTATGC CTTTTTAATG 60 

GATAAATAGT TAGTCTTTTT TAAAGACCGG ATCTTTCAAA CTCTGCATAC TGGCATTGAT 120 

CACCGCGCCT AGGATAACAA TTTTAGCAAT CAAGATAAAC CAAAACATCA TAACAACAAG 180 

AAGAACGGAA CCTAAAATTC GGACATCCAC CAAATGATGG ACATAGTAAT TGAGATAACT 240 

AGAGAACAGA GTTAGTAAAC CTAAAATCAC TAAGAGAACA AAGGCACTGC CTGGTAGGGT 300 

ATAGCTAATT TTCCTGTTAG ATAGATTGGG AAGAAAATAA TAAAGCATGA CCAAGATAGC 3 60 

AAAGAGGAGG GCGTAAATCA GAGGACCTGC CAACCCTTGT AAAGCCTGAT AGATAATGCC 420 

ATCTTTTGTC CAATAATGAG CAAGTAAAGC CAAAATCATC TGACCAA_ATA AGATCAA.V'J^. 4SC 

CAAGGCAAAC GCAAAGAGGA GCTGCAACCA AAACTGACTA GGAGACTTAG CATCTGATGG 540 

GAAATAAGTC CACGACTCTT TTCGACGCCA TAAGCCTTGT TAAAAGCTTT TTGCAAGAAA 600 

TTCATAGATT TTGAAAAACT CCATAACGCC GATAAAACAG AAAAACTCAA TAAACCTGTT 660 

GAAGGTTGCG TCAAGACTTC TCTGGCTATT TTTTCCACAC CTTCATAGAG GCTTGGGGGG 720 

CAGACGTCTT TCATAAAGCC CAAAAATTCT CCCA 754 



(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 base pairs 

(B) TYPE: nucleic acid 
to STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 
GGGGATAACT CTAGGAGTAC CGCTATTACT CGACTTAATG AGTGCACAAG AAGTCAGGAT 60 
TTTTATGCAG GTTGGGCGCT TCATCAGACA GGGAAGATTT ACAGCGACTA TTATGGAAGT 120 
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CAAGGTTTGC TTTATTATTT GCTGACTTAC GTGAGTCAGG GCGGATTTTT CTTTGCCATC 180 

TTTGAGTGGT TAGCCTTGGT AGCAGGAGGA TTTTTCCTTT TTAGATCAGC GGACACCTTG 240 

ACAGAGCAAG GAGACCAAGC TGGACATCTG GTGACTATTT TTTACATGCT AGTTACAGGT 300 

CTTGCTTTTG GTGGAGGCTA TGCGACTCTT TTAGCGCTTC CTTTCTTATT CGCAGCCTTT 360 

AGTTTAGTTG CGGCTTACCT AAGCAATCCA AGCCATGATA AGGGATTTGT ACGGATTGGG 420 

CTAGCTTTGG CAGGCGGATT TTTCTTTGCT CCCTTATCAT CGCTCCTGTT TATTGCTGTA 480 

GTGAGTTTAG GCTTGTTGGT CTTTAACCTT GGGCATAGAC GCTTTGCGCA TGGGTTTTAT 540 

CAGTTTCTTG CAGTGGCTTT AGGTTTTTCA CTTGTCTTTT ATCCAACTGC CTACTATAGT 600 

GCTGCAACAG GAAGTTTTGG GGATGCGwTT AGTGGTATTC GTTATCCTAT TGACAGTATT 660 

CGCTTTGATT TTACTTCTAA AATTTTAGAG AATATGTTTT TTTAAGG 707 



(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

GGATTTTGAA AAACCATACC GATTTGACGA CGTATATTCC AAACATTTTC CTCAGTCAAA 60 

CGTTGGCCAT CAATTACAAT CTCTCCGGAT TCTGCTTCCA GTAAGCCATC AATTAATCGA 120 

ACCGTCGTTG ATTTACCACT ACCATTATGC CCTACAATCG AAAGCCATTC TCCACGTTTC 180 

ACGTGAAAgT AATATCCTTC ACATCGTAGT AGTTCTGATT TTCTTTATAG CGAAAAGAAA 240 

GATTTTTTAC ATCAATTATT GATTTCATTT CGAACCAAAT GTCCCTTTAA ATACATAGGC 300 

ACTACCCTTG AAATAGTCAT AGCCAGAGTA GATAGTGAAA AATAAGGCTA CATAAAGTAG 3 60 

AACTTGACCA AGCAAAGTCC AATGTAATAG CAAGAAAATA ATGGCAAACA TCTGACTA.'UV 420 

AGTTTTAATT TTTCCAGGCA TTGCTGCTGC TAAAATTGTT CCACCAGTTT CAACCAATAA 4 80 

AAGCCTTAAA CCTGTCACAG CTAACTCACG ACAGATAATC ACTGCAACAA TCCAAGCCGG 540 

AGCCATACCT AACTCAATCA ACATAATAAA AGCCGACATA ACTAGTAACT TATCCGCCAT 600 

AGGATCTGCA AATTTACCAA AATTACTGAC CACATTCCAT TTACGAGCTA AATATCCATC 660 

TAAATAGTCG GTAATACTGG CAACAGCAAA GATAATAGCT GCAACTATAT GACTCTCTAT 720 

CGAATTTCCT ATCGTTAAAA TAAAGATAAA AATAGGTATA AA 7 62 
(2) INFORMATION FOR SEQ ID NO: 343: 



wo 98/18931 



PCT/US97/19588 



1363 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 82 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

CTTTTGATAC ACTTAAACTA TGAATACAAA TCTCAAGCCC AAACTTCAGC GTTTTGCTTC 60 

TGCGACTGCC TTTGCCTGTC CTATCTGTCA AGAAAATCTG ACTCTGTTAG AGACTAATTT 120 

CAAGTGCTGC AACCGTCATT CTTTTGACTT GGCGAAATTT GGCTATGTCA ATCTAGTCCC 180 

TCAAATCAAG CAATCTGCTA ACTACGACAA GGAAAATTTT CAAAACCGTC AACAAATCCT 24 0 

AGAAGCCGGC TTTTACCAAG CTATCTTAGA TGCTGTATCT GACTTGCTTG CAAGCTCAAA 300 

AACTACCACA ACAATTTTGG ATATCGGTTG TGGTGAAGGA TTCTATTCTC GCAAACTACA 360 

AGAAAGTCAC TCTGAAAAAA CTTTCTATGC CTTTGACATC TCCAAAGATT CAGTCCAAAT 420 

CGCGGCTAAA AGTGAACCCA ACTGGGCAGT CAATTGGTTC GTTGGCGACT TGGCACGACT 480 

TC 482 



(2) INFORMATION FOR SEQ ID NO: 344: 

' i ^ S E'^UEMCE C HAPJiCTEFv I ST I C G : 

(A) LENGTH: 520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

TTTATTTTTA TAAAGTCAAT ACCTGTCTTT ACTTTTTCTT AAAAAAAGTT TATTATGTTC 60 

TTTAAGGAGG TGTAAAACAT GAAAATAAAT AATAAACTCG TTGGAGAACG TATTCAAAAT 120 

ATCCGTTTAA GCCATGGCGA CTCTATGGAA AAATTTGGAG AAAAATTTAA TACTAGCAAA 180 

GGTACAGTTA ACAACTGGGA AAAAGGTCGC AATTTACCAA ATAAAGAAAA CCTACTAAAA 240 

ATTGCATCTA TTGGAAAAAT GAGTGTTGAA GAGTTACTCT ACGGCGATTA CAATACTTAT 300 

CTACACTTAA AGATTATGGA TTTAGCTCCT GAATGTATAA AAAATTATGA TGAGTATAAC 360 

TCTTTACACG ATGATATAAC AAATAAAGCG TTACAGATCG CTCAAAATAC CATTTCTAAG 420 

ATTGATTATC AAATTTCAGA CGAAACGATC AAAAAATTTA TTGATTTAGC TATCGAACAA 480 

TCGAGAGATT TGCAAGGAAA TTTGTTGAAA AATAACGGGT 520 
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(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 



GCATCAAATC 


CGCCATCAAA 


GAAGTTCTCT 


GGATTTACCA 


AGACCAGTCA 


AATAGCTTAG 


60 


AAGTGCTTAA 


TGACAAGTAC 


AATGTTCACT 


ACTGGAATGA 


CTGGGAAGTT 


GGAGACACGG 


120 


GAACCATTGG 


TGAGCGCTAT 


GGTGCCGTTG 


TTAAGAAACA 


CGACATTATC 


AATAAGCTTC 


180 


TCAAACAGTT 


GGAAACCAAT 


CCTTGGAACC 


GCCGCAATAT 


TATTTCGCTC 


TGGGATTACC 


240 


AAGCTTTCGA 


AGAAACAGAT 


GGGCTGCTCC 


CGTGCGCCTT 


TCAGACCATG 


TTTGATGTTC 


300 


GGCGTGTTGA 


TGGGGAAATC 


TATCTGGATG 


CGACCTTGAC 


CCAGCGCTCC 


AATGATATGC 


360 


TGGTGGCCCA 


CCACATCAAC 


GCTATGCAGT 


ATGTGGCTTT 


GCAGATGATG 


ATTGCCAAAC 


420 


ATTTTGGCTG 


GAAGGTTGGG 


AAGTTCTTCT 


ACTTCATCAA 


CAACCTCCAT 


ATCTATGATA 


480 


ATCAATTTGA 


ACAAGCTCAG 


GAATTGCTCC 


GTCGGGAgCC 


GTCAAACTGC 


CAACCACGCT 


540 


TGGTTTTAAA 


TGTTCCTGAT 


GGGACTAATT 


TCTTTGATAT 


CAAAGCAGAA 


GATTTTGAGT 


600 


TGGTGGATTA 


TGACCCTGTT 


AAGCCACAGT 


TGAAGTTTGA 


CCTAGCTATT 


TAAAAGAATA 


660 


GAAAAAAGAA 


GTTGAGAATA 


ATCCCAACTT 


CTTTTGTTTC 


TTAACGTGAT 


ACGCGGCGAC 


720 


GAGCTGCTTT 


TTTACGGTTT 


TCTTCGATGA 


AAGCTGCTTT 


TTGCTCTTCT 


GGTTCGATTA 


780 


CTTTCTTTTT 


AAATGCGTAT 


ACTGCACCTG 


CAACGGCAGC 


GACAGTTCCT 


GCGACACCTG 


840 


TTACAAGACC 


TTTAGCGAAT 


CCTTTAGCCA 


TGAGTCTTCC 


TCCTTTATAT 


TCTCAATCAG 


900 


CCAGCCTCCT 


CAAGAGGTCA 


CATTTTTCTG 


ACTGACCTTT 


TTGTGTTATA 


ATAATAGTAA 


960 


CGAAAAAATG 


GGAATTTTTC 


AAGGAAAAAA 


GATGAGAACA 


AAA 




1003 



(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 
CCGCACGTAC TATTCCAGAT GCCGAGGAAG TGGACCTCAT CCTCGTTGGC GCAACTGGTC 
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TCAACGCCTT TGAACGCCTC TTGGTCGGCT CTTCATCTGA ATACATACTC CGCCATGCTA 120 

AGGTCGATTT GCTGGTTGTG AGAGAACAAG AAAAAACCTT ATAATCACAA AGAAAAGGAG 180 

CCCCTAGCTC CTTTTTGTTT ACGATTTATT TCTCTCTTTA TGGCGTTCGT AAGCCTTGAG 240 

CTGGCGCTGC AGTTCCTTTT TAATAGCAGG TTCTGGAGCA TATTTTTCTT CCCAATTATC 300 

TGGTTTTAAG ATTTTATGGG TCACTGGATC AAAATGAGCC TTGCCATCTG GAAAAATTTT 360 

CCCCATATTG GCCTGATGGA CAATATCAAA AATACGTTCT GGGTCCACCC CCATCAAGAC 420 

AAAACTGCCG TAGGTGAAGT AAAGCGTGTC AATCAAGGCA TCCACTTGCC CTATCAAATC 480 

TTGCTGAGCA GGTGTCTTCT TGGCTACTTT ATCTGCTGCC TTATCAAGGG CCTGATGAAG 540 

TTGCGATACA GCTTGACCAA AATCTTCTTC AGAAGGACTG GCTGCTCGAA CAAACTCCAC 600 

CAATTCTTCT ATTTTAAAAC CAGCCCTATG GGTTGCACCC TCTAAATCCC AAGCTCGAGG 660 

TTCTTCTTGG GTTCGTTCAT CCATCATGTG GTGGAAAGTC TTGACCTTAT TGAAATGATA 720 

GTCACGGCTG ACAAAGACTT TTTCTGAAGA 750 



(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOfiy .* 1 ine€ir 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 

CGCAACATAC GGATAACCTC CAAAGAATAT TTTTATATTA TAGCAAAGCT TTAAATTGAA 60 

TGTTAGAGTC TTGTTCAAAA CAATCATCAA AACCACGTGG ATGATGGTAT TCTACTAAGT 120 

GTTGATCTTG AGGATAAGTG TACTTACCGC CAACTTCCCA GATAAATGGA TGGAAATCGT 180 

ATTGCAAGCG ATCTTTTCGC ATTTTCCAAA GTTCTAGAAT CTCATTAGTA GAAGCCATGA 240 

AGTTAGACCA GATATCATAG TGAACTGGGA TAATGACTTT GGTACGCAGA TTTTCTGCCA 300 

TACGAAGAAG GTCGATAGAT GTCAkTTTGT CTTGGATACC TACCGGATTT TCACCATAGT 360 

TATTCAAAGC AACATCAATT TTAAAGTCTT TACCATGTTT TGCAAAATAG TTTGAGAAGT 420 

GAGAATCTGC ACCATGATAG ATGGTTCCAC CTGGTGTTTC AAAGATATAG TTAACAGCCT 480 

TTTGAGCCAT TTCTTCATCT GTAACAGCCA AGCCAGCAgT TCACCGCCTG TCTCATCAGC 540 

ACCGTTCACT GGGAGAGTTA CCAAGCAAGT ACGGTCAAAT GATTCTACTG CATGAA 596 
(2) INFORMATION FOR SEQ ID NO: 348: 
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{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

CAGAGTCAAC AGCCTGAGTT GAAGGCAACT TTAGACACAG CAGTTACGAC AGCTGAATGA 60 

GCTCCTCCAT CAGTTTTTTC TTTAATGAGT CCAGCTACAT CTTCAACTTC GAGGCCGTTA 120 

ATCACAATGT CAGCGCCTAC TTCTTTTGCA AGGGCAAGTT TGTCATTGTT GATATCGACT 180 

GCGATAACAT GAGCATTGAA TACTTTTTTA GCGTATTGAA CAGCGAGGTT ACCAAGTCCA 240 

CCAGCACCGT AAAGAACAAC CCATTGGCCT GGTTCAACTT TTGCTTCTTT GATAGCTTTA 300 

TAGGTTGTTA CTCCAGCACA TGTGATAGAA GAAGCTTGGG CTGGATCAAG TCCGTCAGGA 360 

ACTTTGACAG CATAGTCAGC AGTTACGATA CATTCTTCAG CCATACCACC GTCTACTGAG 420 

TAGCCAGCAT TTTTCACTGT ACGGCAAAGG GTTTCGCGAC CAGTTGTACA GTATTCGCAA 480 

GTGCCACATC CTTCAAAGAA CCAAGCAACG CTGACGCGGT CACCGACTTT AAGGCTTTTC 540 

ACATCTGGAG CAATCTCTTT AACGATACCG ATACCTTCGT GCCCAAGAAC ACGTCCTGGG 600 

ACTTGACCAA AGTCACCATG AGCAACGTGG AGGTCGGTGT GGCAAACGCC CACAGTATTC 660 

ACTTCTACAA GTG 673 



(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

GTACCCTACA AATGCTTTAC AGTATGGGTT GAGGGTGGTC AATGGAACTA TGGAGTAGGT 60 

TGGACAGGAA CTTTTGGATA TTCTGATTAC TTACATTCTA CTCGATATCA TACAGCAACT 120 

GTTAGACATG GGGGTAGAAC CTCTAAGGAT TATGCAAAAC CTGAGGCATG GGCTAGAGCT 180 

TCCCTCACCA AGATTCCG 198 
(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 
{D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

GCTTCTTCTA TAGACAAAAA TATCATGGGT AAAATAATCA AGGCTATAGC TAGAAGGAGG 60 

GACCAATCCA CTACTAATCC TAAGAACAAA ACACTCAAGA GAGCAGAAGA GAGAGGTTCA 120 

CTGGCACTGA TAACGGCAAC CACCAAAGGA GAAACCAAGG ACACAGCCTT CATGGAAATG 180 

AAAAAAGCAA AAGCCGTTCC AAAGAAAGCG ATAATGAGGC AAATCAAGAT ACTCCAAATA 240 

TCAAGAGTAA AGGAAAGCTG ATAAACCGGC GAGAGGACAT TGCTAAACAA ACCTGCCAAA 300 

ATCATCCCCC ACCCAACCGT AGGAACAAAA CCATAACGCT TAGCAAAAGG TTGGGGCAAG 360 

ATAACATTAA ACATAACACC CATGGCACTC AGCAAACCTG TTATAAGAGC TAGCGGCGTC 42 0 

ATGGATAACT GAGAGAGGTC TCCCTTTGTC GCCATCAAGC AAACACCCAG CATGGCAACC 480 

AAAACATAGA AAACAGCGCT TTTTGACGCT CGTTTTTGAT AAACCAAGCG ATTGTAAAAG 540 

AGGATAAAGA CAGGGCTAAT AAACTGTAAA ATAGTTGCTG TCGTAGCATT TGAGTATTCT 600 

ACACAGAGAT AGAAAAAATA CTGAACTGAA AAAATCCCCA AAATAGCATA GGCTAAAAAG 660 

GGCAGGTAAT TTTTCTTGTC TCGCCAAATA TCTAGCACTT GCGATTTTAA TTGTATTGCA 720 

GACCAAATGA GTACAAGACT CCCTGCCAOT GTCAAACGCA TACACGTAAT CCAGCCCGAA vao 

GACACCTGAT AATGAGTAAA GAAGTACTCT CCTAAAATTC CACAGATTCC CCATATTAAG 840 

CCGGATAGGA GCGAATAAAT TTTTCCGTTA ACAATCTTTT TCTGATACTG A 891 



f2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

GAAAGCGTTC AATAGAACAT TGCTTTTTTA TTTTTAGAGT AAGCTAAGCG CTTCAGCATC 60 

TGCGATGATG GTTACATCAG GGTGATTTTG GAGGCTACTT GCAGGTAGGT TCTCAGTCAC 120 

TGGGCCAGAT ACTGTTCCGG CAATGGCTTC TGCTTTCGAC TCACCGTAAG CAAAAAGAAT 180 

AATAGACTTG GCATCCAAAA TGTTTTTAAT CCCCATTGAA ATAGCTTGGG TTGGGACGTC 240 

TTCAATCTTG GCAAAGAAGC GTGCATTGGC TTCGATAGTA GACTGGTCAA GTTCTACTAG 300 
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ATGCGTTTGA CTGTCAAATG GAGTG 325 
(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 

CAAGAGCAGT TTGATGATTT TTGATAAGCA TGCGAATTTA /UlATACAAAT ATGGCAATCG 60 

CAAGTTTTGG TGTAGAGGCT ATTATGTAGA TACGGTAGGC CGTAATCAGA AAGTGATAGC 120 

TGAATATATT CAGAATCAAT TACAAGAAGA CAGAGTAGCA GACCTAGCTC ACGTTATTCG 180 

AGTCAGTAGA TCCGTTTACT GGCGAAATAA ATAAGAGGAA GTAACGTnAA GTGCTTTAGC 240 

ACCTGCTCGG GAAAGTGGTG CGCGAGGAAG CTATTTCAGG ATGCTTTGGC CCTGGCCGGT 300 

AGAAGCGTTA TAGCCGCAGA CTACGACACT TCACACTGGT GGTT 344 
(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 base pairs 

(B) TYPE: nucleic acid 
\CJ STRAIvDEDriESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 



CCCTATCCCT 


GCTATTGGGG 


CTGCTCTCAT 


TGCTGCTTTG 


GCACAAATCA 


GTCTTCCAAT 


60 


TGGACCTGTT 


CCCTTCACTC 


TGCAAAACTT 


TGCAATCGGC 


TTGATTCTAC 


TGTCTTTAGA 


120 


CCGAGAGAGG 


CTGTACTTTC 


TGCTGGACTC 


TATCTTCTTC 


TAGGTGCTAT 


CGGTCTTCCT 


180 


GTCTTTGCAG 


GAGGTGGAGC 


TGGTTTTCAG 


GCTTTAGTTG 


GCCCTACTGC 


AGGCTATCTT 


240 


TGGTTTTATC 


TCGTTTACTC 


TGGACTTACT 


TCCTCTCTAA 


CCAACAGCAA 


GAGTGGTGTT 


300 


GTTAAGATTT 


TTCTTGCAAA 


CCTCTTGGGT 


GATGCCCTTG 


TCTTTGTCGG 


CGGGATTCTC 


360 


AGCTTGCATT 


TCCTAGCTGG 


AATGGCATTT 


GAAAAAGCTC 


TTGCTGTGGG 


GGTTCTTCCC 


420 


TTTATCATTC 


CAGACCTTGG 


CAAACTTCTA 


GCTATTAGTT 


TTATTAGCCG 


TCCCCTACTT 


480 


CAACGCCTTA 


AAAATCAGGC 


TTACTTTACT 


AACTAAAAAA 


GGATATCGAG 


TTATCATGAC 


540 


TCAATATCCT 


TTTCTTTTAT 


TTTGAAAACT 


TATACTCAAT 


GAAAATCAAA 


GAGCAAACTA 


600 


GGAAGCTAGC 


CGCAGGCTnG 


CAAAACACTG 


TTTTGAGGTT 


GTGGATGAAA 


CTGACGAGTA 


660 
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AnATCTCATA CATACGGCAA GGCAAAGCTG AC 692 

(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1005 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 

GTGATGGACT ACTGGTTCAA AACGCATCCA GAAGATTTTT TCGATAATGT CGGACCTCTT 60 

GTAGCCAGTA ACTTTTTTCA TACTTACACC GAAGATTTCC ACTTGATGAA GGAAATTGGA 120 

GTTAATTCTT TCCGCACTTC CATCCAATGG AGTCGACTCA TCAAGAATTT AGAGACAGGT 180 

GAGCCTGATC CAAAAGGTAT TGCTTTCTAC AATGCCATCA TTGAAGAAGC TAAAAAGAAC 240 

CAGATGGATC TTGTGATGAA TTTACATCAT TTTGATTTAC CAGTGGAACT TCTTCAAAAA 300 

TACGGTGGTT GGGAAAGCAA ACATGTAGTG GAGTTATTCG TGAAGTTTGC CAAGACTGCT 3 60 

TTCACATGCT TTGGAGATAA GGTTCATTAC TGGACAACTT TCAATGAGCC AATGGTCATT 420 

CCAGAAGCAG GGTACTTATA TGCTTTCCAT TATCCAAATC TAAAAGGAAA GGGAAAAGAG 480 

GCCGTACAAG TCATCTATAA TCTAAACCTT GCTAGTGCAA A^AGTOATTCA ACTATATCCC 540 

TCATTAGAAC TTGATGGAAA GATTGGGATT ATTTTAAACT TGACACCTGC TTATCCAAGA 600 

AGTAATTCTC CAGAAGACTT AGAAGCAAGT CGATTTACAG ATGACTTCTT TAACAAAGTC 660 

TTCTTGAATC CAGCTGTTAA AGGAACTTTC CCAGAAAGAT TGGTAAAACA GCTAGAGAGA 720 

GATGGCGTGT TATGGAGTCA TACCGAAAAA GAGCTTCAAC TGATGAAATC AAATACGGTT 780 

GATTTTCTTG GAGTAAACTA CTACCATCCA AAACGTGTTC AAGCACAAGC AAATCCTGAG 840 

GAATATCAGA CGCCCTGGAT GCCAGACCAA TACTTCAAAG AGTATGAATG GCTGGAGCGT 900 

CGCATGAATC CATATCGTGG TTGGGAAATT TTTCCGAAAG CCATTTATGA TATTGCTATG 960 

ATTGTGAAGG AAGAATATGG TAATATCCCA TGGTTTATCA GTGAA 1005 



(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS I double 

(D) TOPOLOGY: linear 
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Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 
CCGACAAGCA ATATTAAAAA GAGTAAACTA TTAACTAGTT AATTAACCGG TTTATTACTT 
TATAGTGAAT CAAATATACT TAAGAAAAGA GGAAAGAATG AAAATTAATA AAAAATATCT 
AGCAGGTTCA GTGGCAGTCC TTGCCCTAAG TGrTTGTTCC TATGAGCTTG GACGTTACCA 
AGCTGGTCAG GATAAGAAAG AGTCTAATCG AGTTGCTTAT ATAGATGGTG ATCAGGCTGG 
TCAAAAGGCA GAAAACTTGA CACCAGATGA AGTCAGTAAG AGGGAGGGGA TCAACGCCGA 
ACAAATTGTT ATCAAGATTA CGGATCAAGG TTATGTGACC TCTCATGGAG ACCATTATCA 
TTACTATAAT GGCAAGGnC CTTATGATGC CATCATCAGT GAAGAGCTCC TCATGAAAGA 
TCCGAATTAT CAGTTGAAGG ATTCAGACAT TGTCAATGAA ATCAAGGGTG GTTATGTCAT 
TAAGCTAAAC GGTAAATACT ATGTTrACCT TAAGGATGCA GCTCATGCGG ATAATATTCG 
GACAAAAGAA GAGATTAAAC GTCAGAAGCA GGAACGCAGT CATAATCATA ACTCAAGAGC 
AGATAATGCT GTTGCKKTAG CCAGAGCCCA AGGACGTTAT ACAACGGATG ATGGGTATAT 
CTTCAATGCA TCTGATATCA TTGAGGACAC GGGTGATGCT TATATCGTTC CTCACGGCGA 
CCATTACCAT TACATTCCTA AGAATGAGTT ATCAGCTAGC GAGTTAGCTG CTGCAGAAGC 
CTATTGGAAT GGGAAGCAGG GATCTCGTCC TTCTTCAAGT TCTAGTOATA ATGCAAATCC 
AGCTCAACCA AGArTCTCAG AGAACCACAA TCTGACTGTC ACTCCAACTT ATCATCAAAA 
TCAAGGGGGA AACATTTCAA GCCTTTTACG TGAATTGTAT GCTAACCCTT ATCAGAACGC 
CATGTGGGAT CTG 

(2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 
GGTCGCATCT GCAATATCTG TCGCCTCCAC ATAAGCGACA CCAGCCTTGT CTGCTGCCCG 
TTTGACACGT TCTGCAGATT GACCCAGGAT GACCATCTTC TTGAGTCCAG TAATGTCTGG 
CACCAATTCG TCAAACTCAT TGCCACGGTC CAAACCACCT GCAATCAAGA CGACCTTGCT 
GTTGTCAAAT CCTGACAAGC TTTTTGAGTA GCCAAGATAT TAGTT<SATTT ACTGTCGTTA 
TAGAATTTAA CACsCTTGAT GTCATCCACA AACTGGAGAC GGTGTTTGAC ACCACCGAAG 
GCTGAAAGAG TTTCCTTGAT GGTTTGATTG TCCACATCAC GAAGCTTGGC TACAGCAATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
973 



60 
120 
180 
240 
300 

360 



PCT/US97/19588 



WO 98/18931 



CC».»C« CO.'mCA" C«C,=C«™ 
""""" " 

„c — . ^c.- 

— " 

„ ^--^ — „ — 

TCG 

,D) TOPOLOGY: linear 



420 

480 

540 

600 

660 

720 

T80 

840 

843 



SEQUBNCK OESCBimON: Q^^^ ^^^^^^^^ 

"„,CCC.C^ .<.-C..C. 
-^^^^^^ .TCC...^ 
,„;.CT.OC ^CTCOV^CX T.C^-C ^^^^^^ ^^^^^^ 

— : — — 

3....^.^ — -^^^ ^^^^^e, .^ccc. ..c.^- 

„o .~ ^^.eccc ^c-- 

^^o...oo ^..coc^'^ ^^.^ 

— ° 0^--^ ^^^^^ ..^<^c ccc^- 

^OCCOTCC C<^--Ga 
^orrCKGCTA AGOCCTTCGT 
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(2) INFORMATION FOR SEQ ID NO: 358: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

CCCAGTATTT TTGTCCAAGC ACGACCAGAA AAGGATGATA CAGATCTGGA ATTGGCTCTC 60 

TTAACCATCT tTGAACAAAA TCCTCAGGCT CAGGTCACTA TTTTCGGTGC CTTGGGTGGC 120 

CGTATTGACC ATATGTTGGC CAATGTCTTT CTGCCTAGCA ATCCTAAGTT GGCACCCTAT 180 

ATGCATCAAA TAGAAATTGA GGATGGGCAA AACTTGATTA CTTATTGTCC AGAAGGAATC 240 

AGTCAGCTAG AACCTCGTTC AGACTACGAC TATCTAGCCT TTATGCCAGT TCGGGATAGC 300 

CAAGTATGAG TTGACAGAGG AAAATTTTTT CTTTAAAAAA GTGTACGCTT CTAACGAATA 360 

TATAGATAGG GAAGTGTCGG TAACTTGCCC AGATGGTTAT GTGGTCGTAC TGCATAGCAA 420 

GGACAGGAGG TAGGATGGAA AGTTTACTTA TTCTATTATT AATTGCCAAT CTAGCTGGTC 480 

TCTTTCTGAT TTGGCAAAGG CAGGATAGGC AGGAGAAACA CTTAAGTAAG AGCTTGGAGG 540 

ATCAGGCAGA TCATTTGTCA GACCAGCTGG ATTACCGCTT TGACCAAGCC AGACAAGCCA 600 

GCCAGT-TAGA CCAAAAAGAT TTGGAAGTGG TTGTCAGCGA CCGTTTGCAA GAA Gu3 
{2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 

CACCATGTGA TGTGACGCTG GCCACAGCTG TCAGAAATCT GGCGAGCCAT CGTGTGCAAT 60 

GACTCTTCCC GATGTAATCT TGTTCATAGT CCTTTGATGA ATATGTTCAA GCTGTAGAAG 120 

GTGCGCTTCC TGAACACTTA TCAACTGTTA CAGGCGAGTT GACCAGTCAG GAAACAGATG ISO 

GCTGGTACAC ACTTGCCAAC ACTTCTTCAT CCCGCATTTA CCTAAAACAA GCCTTCCAAG 240 

AAAATAGCAA CCTCCTAGAG CAAGTGGTAG AACCCTTGAC TATTATCACT GGTGGACACA 300 

ACCACAAGGA CCAGTTGACC TATGCTTGGA AAACACTTTT GCAGAATGCG CCACATGATA 360 

GTATCTGTGG CTGTAGCGTG GACGAAGTTC ACCGCGAGAT GGAAACGCGT TTTGCCAAGG 420 
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TCAACCAAGT AGOAAACTTT GTTAAAAGTA ACTTGCTCAA CGAGTGGAAG GGTAAAATTG 
CTACGGATAA GGCTCAAA<^ GACTATCTCT TTACTGTCAT TAACACAGGC TTGCATGATA 
;VGGTCGATAC TGTCAGCACA GTGAriGATG TGGCGACTTG TGA^rrCAAG GAATTGCACC 
CAACAGAAGG CTACAAAAAG ATGGCTGCTC TTATCTTGCC G 
(2) INFORMATION FOR SEQ ID NO: 360: 

(i) SEQUENCE C3IARACTERISTICS : 

(A) LENGTH: 1958 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



480 
540 

600 
641 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 
CCTCAAGGCC AArTTGAAGG CTCTAAAACA ATGGAAAAGT GCTACACAGA TGT.3ACAGAA 
rrTGCCATTC CAGCAAGTAC TCAAAAGCTT TACTTATCAC CAGTTTTAGA TGGCTTTAAT 
AGCGAAATTA TTGCTTTTAA TCTTTCGACT TCACCCAACT TAGAACAAGT ACAAACAATG 
rrAGAACAGG CATTCAAAGA GAAGCACTAC GAGAATACGA TTCTCCATAG TGACCAAGGC 
_...,n..TC A?-CGAT?C TTATCATCGG TTCCTAGAGA GTAAGGGAAT TCAAGCATCT 
li^TC^CGCA AGGGCAACAG CCAAGACAAC GGTATGATGC AATCTTTCTT TGGCATrr^A 
A^ATCCGAAA TGTTTTATGG CTATGAGAAA ACATTTAAAT CACTTAACCA ATTGGAACAA 
GCCATTATAG ACTATATTGA TTACTACAAC AACAAACGAA TTAAGGTAAA ACTAAAAGGA 
CTTAGTCCTG TGCAGTACAG AACTAAATCC TTTGGATAAA TTAATTCTCT AACTTTTTGG 
OGTCAGTACA AAACTCTTGC TACTATGCGT TTTATTATTG AAAGACTTAT TGGACTTTCT 
CTCAAATCGA GWTTPACTC AATTrrCTTA CTTGATTGGG ATTGAAATTC CAATTAATTT 
CTCTGAGTAG AGTGTCTTGA TATT^GCTTC ATCAACAGAG GCCTTATCAA TTTTACGTTT 
CAAGAAAAAT TCTTGAATGG TTTCGATTTC AGGCTCACGA ATAGCACGGT GTTTGTTTGA 
GAT^AGGATT TCATAGTGAA GCGGAGCTO3 GGTAAAAATA ACATCTGTAT TCCCTGCAGA 
ATAAACCTCA ACAAGGGTI^ CATCGGTACT TTCTAGCTGA CTTTTTACAA GTTGCGAGTG 
^TTGTC GTATTGATAA GCTTCATAAT ATTOXTTCCG ATTTTCTAAT TCTATTATAG 
CACTTTTTGA ATAAAGTCGC TTGATTTATA CTCAATGAAA ATCAAAGAGC AAACTAGGAA 
GCTAGCCGCA GGCTATACTT GAGTACGGTA AGGCGACGCT GACGTGGTTT GAAT^TTATT 
TTCGAAGAGT ATTAGCCAAT CTTATGCTC^ TTT^TCCAAG ATTCAATGGC CCATTTATGG 



60 

120 

180 

240 

300 

360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



wo 98/18931 



PCT/US97/19588 



1374 

CTACCACGTT TAAOJI^TTT GATAGCCTCG TCAATAGGGA ACCAGGCAAT A^^ATTAAAG 
TTTTCTAGTG GCTTTTGTAC TTCTTTGAAA GGAGTTCCTT CATAGAGGTA GGCAGGATTG 
TAGTAGTAGG TATCACGATG ACGAGAATAG AAATArTCGT CAGCTTGTCC GTAATAGGTA 

CCAArrrCTG ctgtgaaacc aagctcttca atcaactcat gctttagggc ttcctgatga 
TrrrCACCTG cttcaatttc tccacatggt aggaaccaag caccatttgg ttcttgaaca 

AGAACAATTT GTTTTTGTTC AGGATTAGGG ATAACTGCAT ATACGCCATA GCGAGCAATA 
TAGTCTGTAT TCACTTTTOT TCTCCGAAAG TTGGGTTTGC CATTGCArTT TCCTCATTAT 
CTAGTATCGT TATTATTATA GTGAAATGAA CCAAAAATAG TACACAATGT GGTATAATCT 
TCTTATGGCA TATTCAATAG ATTTTCGTAA AAAAGnHTTC TCTTATTGTG AGCGAACAGG 
TAGTATAACA GAAGCATCAC ACGTTTTCCA AATCTCACGT AATACCA.TT ATGGCTGGTT 
AAAGCTAAAA GAGAAAACAG GAGAGCPAAA CCACCAAGTA AAAGGAATAA AACCAAGAAA 
GGrrGATAGA GATAGACTTA AAAACTATCT TACTGACAAT CCAGACGCrT ATT^ACTGA 

;^TAGCTTCT GAATTTGGCT GTCATCCAAC TACCATCCAC TATGCGCTCA AAGCTATGGG 

tACACTCGAA AAAAAAAAGA ACTACACCTA CTATGAAC 

(2) INFORMATION FOR SEQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 
TATGAAATTA AGTTATGATG ATAAAGTTCA GATCTATGAA CTTAGAAAAC AAGGATATAG 
CTTAGAGAAG CTTTCAAATA AATTTGGGAT An^CAATTCT AATCTTAGGT ATAO^TTAA 
ATTGATTGAT CGTTACGGAA TAGAGTTCGT CAAAAAAGGA AAAAATCGTT ACTATTCTCC 
TGATTTAAAA CAAGAAATGA TTAATAAAGT CTGACATGAA GGCTGGACTA AAGATAGAGT 
TTCTCTTGAA TACGGTCTCC CAAGTCGTAC GATACTTCTT AACTGGCTAG CACAATACAG 
GAAAAACGGG TATACTATTG TTGAGAAACC AAGAGGGAGA GTACCTGAGA GCGGAGAA«5 
CCATCCTAAA AAACTTTAAGA GAACTCCGAT TGAAGGAGGA AAAAGAGAAA GAAGAAAGAC 
AGAAATTGTT TAAGAATTAA TGACTGAGTT TTCGTTAGAT CTTCTTTTAA AAGTCATTAA 
ACTAGCTCGT TCGACCTACT ACTATCACTT GAAACAGCTA GATAAACCAG ATAAGGACCA 
AGAGCTTAAA GCTGAAATTC AATCCArTTT TATCGAACAC AAAGGAAATT ATGCTTATCG 
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TCGGATTTAT TTAGAACTAA GAAATCGTGG TTATCTGGTA AATCATAAAA GAGTTCAAGG 660 

CTTGATGAAA GTACTCAATt TACAAGCTAA AACGCGACAG AAACGAAAAT ATTCTTCTCA 720 

TAAAGGAGAC GTTGGCAAGA AGGCAGAGAA TCTCATTCAA GGCCAATTTG AAGGCTCTAA 780 

AACAATGGAA CAGTGCTACA CAGATGTGAC AGAATTTGCC ATTCCAGTAA GTACTTAAAA 840 

851 

GCTTTACTTA T 

(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1168 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 

GGGTAGAATC GATATCTCCA ATGAGTTGGT tTAGCTGGTG AAACTGTAAA AAGATTTCGw 60 

CCAATTCAAG GTTGAGGCAT CGCAAACTAT GGACTGTTTC CTCGTCAGTT CTGGAAAGAA 120 

AACGGGATAA GGTTGGCTGT GAAGCAAGCT GCCCTCCTTC CAACAATTTT GGAAAGTAGG 180 

CATCAGCTGA CAATTCTTTA CAAGCATAGT CCGTTCCATA ACCTGTTAAC AGTTGAAAGA 240 

GGAACTGGAC AAGGATATCT GAATCCGAAT AACGACAGTA GCGGCGTTGG TCATTCGTTA 3 00 

CTAAATACTT AGAAATCCGC TCTTTTAGTT TCAACTGGGA AAAAAGTTCC TGAAAAAAGA 3 60 

TAAGACCACC ATACTGGGTT AAATGACCTC CATCGAAAGA TAGTTGGTAA AAAGACTTGT 420 

TTTGGAAGTG ATGATTTGGT AAACTGTTCA TGTGAGTTTC CTTTCTTTTT GTGTTTTTTT 480 

CTACACTTAT ACCATAAAGG GGAAACTCTT TTTTGTCTAG TAAAAAACAC CCATTGGGTG 540 

AAAAAAGAAA CCATCCAGGA TCTAAGCTAA GGCAAGGATT CTGGATGGTT TTTAGATTTG 600 

GGGTGAATAA TTGGGGTTTT AGCTGCTTGC GGCCAATCAG GTTCAGATAC AAAAACTTAC 660 

TCATCAACCT TTAGTGGAAA TCCAACTACA TTTAACTATC TATTAGACTA TTACGCTGAT 720 

AATATAGTCA ATTGAAACAA GAACAAGACA AAAGAGCCTC ATAAAAGGTA TTGCAACTTG 780 

GTAATACCTT TTTGAGGTGC TTTTTGATAT GAGCCCATGT TTTCTCAATA GGATTGTACT 840 

CAGGTGAGTA GGGAGGAAGA GGTAAAAGTT TATACCCAAA CTCTTCACAC AAGAGTTCTA 900 

ACTTACCCAT TCTATGGAAT CTTGCATTAT CCATAATAAT AACCGATGGT GTGTTTAATG 960 

TTGGTAAGAG AAATTTCTGA AACCAAGCTT CAAAAAAGTC GCTCGTCATC GTCTCTTCGT 1020 

AAGTTATTGG AGCGATTAAC TCACCATTTG TTAGACCTGC AACCAAAGAA ATCCTCTGAT 1080 



"I 
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ATCrrCTTCC AGATACrrrC CCTCrrCTTA ACTGACCTTT TAA1..AGCGA CCATA^CTC 1140 

1168 

GATAAAAATA AGTATCGAAT CCTGTTTC 
(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIOGV: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 
GTCAGCrrCA GCAAGCCCAT CAGCTTCTGA ATCTGCA-K:A ACCAGTGCGT CCGCTTCAGC 
GTCAACCAGT GCGTCGGCTT CAGCGTCGAC AAGTGCTTCG GCTTCAGCAT CAACGAGTGC 
GTCGGCCTCA GCAAGCGCAA GTACCTCAGC GTCAGCTTCC GCCTCAACCA GTGCGTCGGC 
TTCAGCAAGC ACAAGTGCGT CAGCCTCAGC AAGTATCTCA GCGTCTGAAT CGGCATCAAC 
GAOTGCGTCT GAGTCAGCAT CAACGAGTAC GTCAGCCTCA GCAAGCACAT CAGCTTCTGA 
ATCTGCATCA ACCAGTGCCT CAGCCTCAGC ATCGACAAGC GCCTCAGCTT CAGCAAGTAC 
CAGTGCTTCA GCCTCAGCGT CGACAAGTGC GTCGGCCTCA ACCAGTGCAT CTGAATCGGC 
ATCAACCAGT GCGTCAGCCT CAGCAAGTAC TAGTGCATCA GCTTCAGCAT CAACGAGTGC 
ATCGGCTTCA GCATCAACCA GTGCCTCGGC TTCAGCGTCA ACCAGTGCGT CAGCTTCAGC 
AAGTACCAGT GCTTCAGTCT CAGCATCAAC AAGTGCTTCA GCCTCAGCAT CGACAAGTGC 
CTCGGCTTCA GCAAGCACAT CAGCATCTGA ATCAGCGTCG ACAAGCGCCT CAGCTTCAGC 
W.GTACCAGT GCGTCAGCCT CAGCGTCGAC AAGTGCGTCA GCCTCAGCAA GTACTAGTGC 
ATCAGCTTCA GCATCAACGA CTGCATCGGC l^CGGCGTCA ACCAGTGCAT CAGAGTCAGC 
AAGTACCAGT GCGTCAGCTT CCGCATCAAC AAGTGCCTCG GCTTCAGCAA GCACCAGTGC 
GTCGGCTTCA GCAAGTACTA GCGCCTCAGC CTCAGCCTCA ACCAGTGCGT CAGCCTCAGC 
AAGTATCTCA GCGTCTGAAT CGGCATCAAC GAGTGCGTCC GCTTCAGCAA GTACTAGCGC 
CTCAGCCTCA GCGTCAACAA GTGCATCGGC TTCAGCGTCA ACGAGTGCGT CTGAATCGGC 
ATCAACGAGT GCGTCCGCTT CAGCAAGTAC TAGCGCCTCA GCCTCAGCGT CAACAAGTGC 
ATCGGCTTCA GCATCAACGA GTGCGTCCGC TTCAGCAAGT ACTAGCGCCT CAGCCTCAGC 
OTCAACAAGT GCATCGGCTT CAGCGTCAAC GAGTGCGTCT GAGTCAGCAT CAACGAGTGC 
CTCAGCCTCA GCAAGCACAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CAgCCTCAGC 
ATCGACAAGC GCCTCAGCTT CAGCAAGTAC CAGTGCGTCA GcTCAGCGTC GACAAGTGCs 
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TCrGCTTCAG 


CAAGTACCAG 


TGCGTCAGCC 


TCAGCAAGTA 


CCAGTGCkTC 


AGCCTCAGCG 


1380 


TCGACAAGTG 


CGTCGGCCTC 


AACCAGTGCA 


TCTGAATCGG 


CATCAACCAG 


TGCGTCAGCC 


1440 


TCAGCAAGTA 


CTAGCGCCTC 


AGCCTCAGCA 


TCAACGAGTG 


CGTCCGCTTC 


AGCAAGTACT 


1500 


AGTGCATCAG 


CTTCAGCAAG 


TACTAGCGCC 


TCAGCCTCAG 


CGTCGACAAG 


CGCCTCAGCT 


1560 


TCAGCAAGTA 


CCAGTGCGTC 


AGCCTCAGCG 


TCGACAAGTG 


CGTCGGCTTC 


AGCAAGTACC 


1620 


TCAGCGTCTG 


AATCAGCATC 


AACAAGTGCG 


TCGGCTTCAG 


CATCAACGAG 


TGCATCAGCT 


1680 


TCAGCATCAA 


CAAGTGCTTC 


AGCTTCAGCA 


AGTACCAGTG 


CGTCGGCTTC 


AGCATCAACG 


1740 


AGTGCTTCAG 


TCTCAGCGTC 


AACCAGTGCC 


TCTGAATCCG 


CATCAACAAG 


TGCCTCGGCT 


1800 


TCAGCAAGCA 


CCAGTGCTTC 


GGCTTCAGCG 


TCAACGAGTG 


CGTCTGAGTC 


AGCATCAACG 


1660 


AGTGCGTCAC 


CTCAGCAAGC 


ACATCAGCTT 


CTGAATCTGC 


ATCAACCAGT 


GCGTCACTTC 


1920 


CGCATCAACA 


AGCGCCTCGG 


CCTCAGCAAG 


TACAAGTGCT 


TCAGCCTCAG 


CATCAACCAG 


1980 


TGCATCAGCT 


TCAGCCTCAA 


CAAGTGCTTC 


AGCCTCAGCG 


TCAACGAGTG 


CGTCGGCTTC 


2040 


AGCAAGTACC 


AGTGCGTCAG 


CTTCAGCAAG 


CACAAGTGCG 


TCAGCTTCAG 


CATCAACCAG 


2100 


TGCTTCGGCT 


TCGGCATCAA 


CAAGTGCCTC 


AGCATCAGCA 


TCAACGAGTG 


CGTCAsCTCA 


2160 


GCAAGTACTA 


GTGCATCAGC 


ATCAGCATCA 


ACCAGTGCAT 


CAGCCTCAGC 


AAGTATCTCA 


2220 


GCGT'CtGAa'I' 


CUGCATCAAC 


GAGTGCATCA 


GCATCAGCAT 


CAACGAGTGC 


ATCGGCTTCA 


2280 


GCGTCAACCA 


GTGCATCAGT 


CTCAGCAAGC 


ACCAGTGCGT 


CGGCTTCAGC 


ATCAACCAGT 


2340 


GCCTCAGCCT 


CAGCAAGTAT 


CTCAGCGTCT 


GAATCGGCAT 


CAACGAGTGC 


GTCAGCCTCA 


2400 


GCAAGTACTA 


GTGCATCAGC 


ATCAGCATCA 


ACGAGTGCAT 


CGGCTTCAGC 


AAGTACCAGC 


2460 


GCCTCAGCTT 


CAGCAAGCAC 


CAGTGCGTCA 


GCCTCAGCAA 


GTACCAGCGC 


GTCAGCCTCA 


2520 


GCAAGCACCA 


GTGCCTCAGC 


TTCAGCAAGT 


ACCAGTGCGT 


CAGCCTCAGC 


GTCGACAAGT 


2580 


GCGTCGGCTT 


CAGCAAGTAC 


CTCAGCGTCT 


GAATCAGCAT 


CAACGAGTGC 


ATCAGCTTCA 


2640 


GCATCAACAA 


GTGCTTCAGC 


TTCAGCAAGT 


ACCAGTGCGT 


CGGCTTCAGC 


ATCAACCAGT 


2700 


GCTTCAGTCT 


CAGCGTCAAC 


CAGTGCCTCT 


GAATCAGCAT 


CAACAAGTGC 


CTCGGCTTCA 


2760 


GCAAGCACCA 


GTGCGTCGGC 


TTCAGCAAGT 


ACTAGTGCAT 


CGGCTTCAGC 


ATCGACAAGT 


2820 


GCGTCTGAAT 


CGGCATCAAC 


GAGTGCTTCG 


GCTTCAGCAT 


CAACGAGTGC 


GTCAGCCTCA 


2880 


GCAAGCACAT 


CAGCTTCTGA 


ATCTGCATCA 


ACCAGTGCGT 


CGGCTTCAGC 


GTCAACCAGT 


2940 


GCGTCGGCTT 


CAGCGTCGAC 


AAGTGCTTCG 


GCTTCAGCAT 


CAACGAGTGC 


GTCGGCCTCA 


3000 


GCAAGCGCAA 


GTACCTCAGC 


GTCAGCTTCC 


GCCTCAACCA 


GTGCGTCGGC 


TTCAGCAAGC 


3060 
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ACAAGTGCGT CAGCCTCAGC AAGTATCTCA GCGTCTGAAT 
GCCTCAGCAA GCGCAAGTAC CTCAGCGTCA GCTTCCGCCT 
GCAAGCACAA GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT 
GCGTCTGAGT CAGCATCAAC GAGTACGTCA GCCTCAGCAA 
GCATCAACCA GTGCGTCAGC CTCAGCATCG ACAAGCGCCT 
GCTTCAGCCT CAGCGTCGAC AAGTGCGTCG GCCTCAACCA 
ACCAGTGCGT CAGCCTCAGC AAGTACTAGT GCATCAGCTT 
GCTTCAGCAT CAACCAGTGC CTCGGCTTCA GCGTCAACCA 
ACCAGTGCTT CAGTCTCAGC ATCAACAAGT GCTTCAGCCT 
GCTTCAGCAA GCACATCAGC ATCTGAATCA GCGTCGACAA 
ACCAGTGCGT CAGCCTCAGC GTCGACAAGT GCGTCAGCCT 
GCTTCAGCAT CAACGAGTGC ATCGGCTTCG GCGTCAACCA 
ACCAGTGCGT CAGCTTCCGC ATCAACAAGT GCCTCGGCTT 
GCTTCAGCAA GTACTAGCGC CTCAGCCTCA GCCTCAACCA 
ATCTCAGCGT CTGAATCGGC ATCAACGAGT GCGTCCGCTT 
GCCTCAGCGT CAACAAGTGC ATCGGCTTCA GCGTCAACCA 
ACGAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCCT 
GCTTCAGCAT CAACGAGTGC GTCCGCTTCA GCAAGTACTA 
ACAAGTGCAT CGGGTTCAGC GTCAACGAGT GCGTCTGAGT 
CCTCAkCAAG CACATCAGCT TCTGAATCTG CATCAACCAG 
AAGCGCCTCG GCCTCAGCAA GTACAAGTGC TTCAGCCTCA 
TTCAGCCTCA ACAAGTGCTT CAGCCTCAGC GTCAGACCAG 
CCAGTGCGTC ACTTCAGCAA GCACAAGTGC GTCAGCTTCA 
TTCGGCATCA ACAAGTGCCT CAGCATCAGC ATCAACGAGT 
(2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



CGGCATCAAC 
CAACCAGTGC 
CTGAATCGGC 
GCACATCAGC 
CAGCTTCAGC 
GTGCATCTGA 
CAGCATCAAC 
GTGCGTCAGC 
CAGCATCGAC 
GCGCCTCAGC 
CAGCAAGTAC 
GTGCATCAGA 
CAGCAAGCAC 
GTGCGTCAGC 
CAGCAAGTAC 
GTGCGTCTGA 
CAGCGTCAAC 
GCGCCTCAGC 
CAGCATCAAC 
TGCGTCACTT 
GCATCAACCA 
TGCCTCGGCT 
GCATCAACCA 
GCG 



GAGTGCGTCG 
GTCGGCTTCA 
ATCAACGAGT 
TTCTGAATCG 
AAGTACCAGT 
ATCGGCATCA 
GAGTGCATCG 
TTCAGCAAGT 
AAGTGCCTCG 
TTCAGCAAGT 
TAGTGCATCA 
GTCAGCAAGT 
CAGTGCGTCG 
CTCAGCAAGT 
TAGCGCCTCA 
ATCGGCATCA 
AAGTGCATCG 
CTCAGCGTCA 
GAGTGCGTCA 
CCGCATCAAC 
GTGCATCAGC 
TCAGCAAGTA 
GTGCTTCGCiC 



3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

aOtiO 

4140 

4200 

4260 

4320 

4360 

4440 

4483 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 
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GTACCTCAGC GTCCTTCCGC CTCAACCAGT GCGTCCGCTT CAGCAAGCAC AAGTGCGTCA 60 

CCTCAGCAAG TATCTCAGCG TCTGTVATCGG CATCAACGAG TGCGTCGGCC TCAGCAAGCG 120 

CAAGTACCTC AGCGTCACTT CCGCCTCAAC CAGTGCGTCG GCTTCAGCAA GCACAAGTGC 180 

GTCASCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC TGAGTCAGCA 240 

TCAACGAGTA CGTCAGCCTC AGCAAGCACA TCAGCTTCTG AATCGGCATC AACCAGTGCG 300 

TCAGCCTCAG CATCGACAAG CGCCTCAGCT TCAGCAAGTA CCAGTGCTTC AGCCTCAGCG 360 

TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACGAG TGCGTCAGCC 420 

TCAGCAAGTA CTAGTGCATC AGCTTCAGCA TCAACGAGTG CATCGGCTTC AGCATCAACC 480 

AGTGCCTCGG CTTCAGCGTC AACCAGTGCG TCAGCTTCAG CAAGTACCAG TGCTTCAGTC 540 

TCAGCATCAA CAAGTGCTTC AGCCTCAGCA TCGACAAGTG CCTCGGCTTC AGCAAGCACA 600 

TCAGCATCTG AATCAGCGTC GACAAGTGCG TCGGCCTCAA CCAGTGCATC TGAATCGGCA 660 

TCAACGAGTG CGTCAGCCTC AGCAAGTACT AGTGCATCAG CTTCAGCATC AACGAGTGCA 720 

TCGGCTTCGG CGTCAACCAG TGCATCAGAG TCAGCAAGTA CCAGTGCGTC AGCTTCCGCA 780 

TCAACAAGTG CCTCGGCTTC AGCAAGCACA TCAGCATCTG AATCAGCGTC AACCAGTGCT 840 

TCGGCTTCAG CAAGTACCAG TGCTTCAGCT TCAGCATCAA CCAGCGCCTC GGCCTCAGCA 900 

AGCACCTCAG CTTCTGAATC GGCCTCAACC AGCGCCTCGG CCTCAGCAAG CACCTCAGCT 960 

TCTGAATCGG CCTCAACCAG CGCCTCAGCC TCAGCATCAA CGAGTGCTTC GGCTTCAGCA 1020 

AGCACAAGCG CCTCGGGTTC AGCATCAACG AGTACGTCAG CTTCAGCGTC AACCAGTGCT 1080 

TCAGCCTCAG CATCAACAAG TGCGTCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 1140 

TCAACGAGTG CGTCTGAGTC AGCATCAACG AGTACGTCAG CCTCAGCAAG CACAAGTGCT 1200 

TCAGCCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC CGCTTCAGCA 1260 

AGTACTAGCG CCTCAGCATC AGCGTCAACA AGTGCTTCGG CTTCAGCGTC AACCAGTGCG 1320 

TCTGAGTCAG CATCAACGAG TACGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA 1380 

TCAACGAGTG CGTCAGCCTC AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG 1440 

TCAGCCTCAG CAAGTACCAG TGCTTCAGCC TCAGCGTCGA CAAGTGCGTC GGCCTCAACC 1500 

AGTGCATCTG AATCGGCATC AACCAGTGCG TCAGCTCAGC AAGTACTAGT GCATCAGCTT 1560 

CAGCATCAAC GAGTGCATCG GCTTCGGCGT CAACCAGTGC ATCAGAGTCA GCAAGTACCA 1620 

GTGCGTCACt TCGGCATCAA CAAGTGCCTC GGCTTCAGCA AGCACATCAG CATCTGAATC 1680 

AGCGTCAACC AGTGCTTCGG CTTCAGCAAG TACCAGTGCT TCAGCTTCAG CATCAACGAG 1740 
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CGCCTCGGCC TCAGCAAGCA CCTCAGCTTC TGAATCGGCC 


TCAACCAGCG 


CCTCGGCCTC 


1800 


AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCAGCGCC 


TCAGCCTCAG 


CATCAACGAG 


1860 


TGCTTCGGCT TCAGCAAGCA CAAGCGCCTC GGGTTCAGCA 


TCAACGAGTA 


CCTCAGCTTC 


1920 


AGCGTCAACC AGTGCTTCAG CCTCAGCATC AACAAGTGCG 


TCAGCCTCAG 


CAAGTATCTC 


1980 


AGCGTCTGAA TCGGCATCAA CGAGTGCGTC TGAGTCAGCA 


TCAACGAGTA CGTCAGCCTC 


2040 


AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCAGTGCG 


TCAGCCTCAG 


CATCGACAAG 


2100 


CGCCTCAGCT TCAGCAAGTA CCAGTGCTTC AGCCTCAGCG 


TCGACAAGTG 


CGTCGGCCTC 


2160 


AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC 


TCAGCAAGTA 


CTAGTGCATC 


2220 


GGCTTCAGCA TCAACCAGTG CCTCGGCTTC AGCGTCAACC 


AGTGCGTCAG 


CTTCAGCAAG 


2280 


TACCAGTGCT TCAGTCTCAG CATCAACAAG TGCTTCAGCC 


TCAGCATCGA 


CAAGTGCCTC 


2340 


GGCTTCAGCA AGCACATCAG CATCTGAATC AGCGTCGACA AGCGCCTCAG 


CTTCAGCAAG 


2400 


TACCAGTGCG TCAGCCTCAG CGTCGACAAG TGCGTCAGCT 


ACAGCAAGTA 


CTAGTGCATC 


2460 


AGCTTCAGCA TCAACGAGTG CATCGGCTTC GGCGTCAACC 


AGTGCATCAG 


AGTCAGCAAG 


2520 


TACCAGTGCG TCAGTTCACG CATCAACAAG 






2550 



(2) INFORMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 



ACCCAGCAAG TACTAGTGCA TCGGCTTCAG CAAGCACCAG 


TGCGTCGGCT 


TCAGCATCAA 


60 


CCAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC 


GGCATCAACG 


AGTGCGTCAC 


120 


CTCAGCAAGT ACTAGTGCAT CAGCATCAGC ATCAACGAGT 


GCATCGGCTT 


CAGCAAGTAC 


180 


CAGCGCCTCA GCTTCAGCAA GCACCAGTGC GTCAsCTCAG 


CAAGTACCAG 


CGCCTCAGCC 


240 


TCAGCAAGCA CCAGTGCCTC AGCTTCAGCA AGTACCAGTG 


CGTCAGCCTC 


AGCGTCGACA 


300 


AGTGCGTCGG CTTCAGCAAG TACCTCAGCG TCTGAATCAG 


CATCAACGAG 


TGCATCAGCT 


360 


TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTATCTCAG 


CGTCTGAATC 


GGCATCAACG 


420 


AGTGCGTCCG CTTCAGCAAG TACTAGCGCC TCAGCATCAG 


CGTCAACAAG 


TGCTTCGGCT 


480 


TCAGCGTCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA 


CGTCAGCCTC 


AGCAAGCACA 


540 


TCAGCTTCTG AATCTGCATC AACCAGTGCG TCAGCCTCAG 


CATCGACAAG 


CGCCTCAGCT 


600 
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TCAGCAAGTA CCAGTGCGTC AgCCTCAGCA AGTACCAGTG CTTCAGCCTC AGCGTCGACA 6 60 

AGTGCGTCGG CCTCAACCAG TGCATCTGAA TCGGCATCAA CCAGTGCGTC AGCCTCAGCA 720 

AGTACTAGCG CCTCAGCCTC AGCATCAACG AGTGCGTCCG CTTCAGCAAG TACTAGTGCA 780 

TCAGCTTCAG CAAGTACTAG CGCCTCAGCC TCAGCGTCGA CAAGCGCCTC AGCTTCAGCA 840 

AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG TACCTCAGCG 900 

TCTGAATCAG CATCAACAAG TGCGTCGGCT TCAGCATCAA CGAGTGCATC AGCTTCAGCA 960 

TCAACAAGTG CTTCAGCTTC AGCAAGTACC AGTGCGTCGG CTTCAGCATC AACGAGTGCT 1020 

TCAGTCTCAG CGTCAACCAG TGCCTCTGAA TCCGCATCAA CAAGTGCCTC GGCTTCAGCA 1080 

AGCACCAGTG CTTCGGCTTC AGCGTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTGCG 1140 

TCAGCCTCAG CAAGCACATC AGCTTCTGAA TCTGCATCAA CCAGTGCGTC AGCTTCCGCA 1200 

TCAACAAGCG CCTCGGCCTC AGCAAGTACA AGTGCTTCAG CCTCAGCATC AACCAGTGCA 12 60 

TCAGCTTCAG CCTCAACAAG TGCTTCAGCC TCAGCGTCAA CCAGTGCGTC GGCTTCAGCA 1320 

AGTACCAGTG CGTCAGCTTC AGCAAGCACA AGTGCGTCAG CTTCAGCATC AACCAGTGCT 13 80 

TCGGCTTCGG CATCAACAAG TGCCTCAGCA TCAGCATCAA CCAGTGCGTC AGCCGG 1436 
(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 
GCAGTTGCCA CACCGTGCTG ACCAGCACCC GTTCCTGCGA TAATTTTCTT 
CGTwTGGCAA GCCAAACTTG TCCTAAGGCA TTGTTAATCT TGTGGGCTCC 
AGGTCTTCCC GTTTGAGATA AATCTTGCTC CGCCAATATG CTGGGTCAAG 
AATAAAGAGG AGTTTCACGT CCTACGTACT GGCGCAAAAG CTGGTTTAAT 
AACTTGGGTC TGCCTGACTT TCACGGTAGG CCTTCTCCAA CTCCAAAACT 
ATGTTTCTGG GACAAAACGT CCGCCGAATT TTCCGTAAAA TCCATCTTTA 
GATATGCCAT GCTTTACCCT CTCTATAAAT CTTCTAATCT TTTCATGATC 
TCTGTCTCCA CTCCGCTCGA TACATCTACT GCATAGGGAG TAAAGTGTTG 
ACTACATTAT CTTCATTAAG GCCACCTGCG ATAAAGAAGG GCTGTGCTAG 



TTTACCCATG 
TGTATGGTTA 
TTTTTTGCGT 
TCCTCTTGGA 
GCTGTCATCA 
TTTGGTTCCT 
TTTTTGTCCA 
AATTGCTTTT 
TCCAGTCGTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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TCCAGTTGAC CCCAATCAAA GGGCTGGCCA CTTCCTGCCA CAGGGGCATC AAAGAGTAGA 
TAATCTGCCT GAGAATTGGG GACATGCCCA TTTCCATCTA CCTGCACAGC CTGAATACTG 
GCACAAGGCA AATTCTCAAA TAAATCATCT GCCACCTGAC CGTGAACTTG AACCAAGTCC 
AAGCCGGGGA TCCTC 

(2) INFORMATION FOR SEQ ID NO: 3 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 

TACTAGCGCC TCAGCCTCAG CGTCAACAAG TGCATCGGCT TCAGCATCAA CGAGTGCGTC 60 

CGCTTCAGCA AGTACTAGCG CCTCAGCCTC AGCGTCAACA AGTGCATCGG CTTCAGCGTC 120 

AACGAGTGCG TCTGAGTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 180 
TGAATCTGCA TCAACCAGTG CGTCAGCCTC AGCATCGACA AGCGCCTCAG CTTCAGCAAG ■ 240 

TACCAGTGCG TCAGCCTCAG CGTCGACAAG TGCGTCGGCT TCAGCAAGTA CCAGTGCGTC 300 

AGCCTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CCTCAACCAG 360 
TGCATCTGAA TCGGCATCAA CCAGTGCGTC AGCCTCAGCA ACTACTAv-TG CATC-.C^..^ 

AGCATCAACG AGTGCATCGG CTTCAGCATC AACCAGTGCA TCAGAGTCAG CAAGTACCAG 480 

TGCGTCAgCT TCCGCATCAA CAAGTGCCTC GGCTTCAGCA AGTACTAGCG CCTCAGCCTC 540 

AGCGTCAACA AGTGCTTCAG CTTCCGCGTC AACCAGCGCC TCGGCCTCAG CAAGTATCTC 600 

AGCGTCTGAA TCGGCATCAA CAAGTGCCTC GGCTTCAGCA TCAACGAGTG CATCAGTCTC 660 

AGCAAGCACC AGTGCGTCGG CCTCAGCAAG CACCAGCGCG TCTGAATCCG CATCAACGAG 720 

TGCGTCAGCT TCAGCAAGTA CCTCAGCATC TGAATCAGCA TCAACAAGTG CATCGGCT-C 780 

AGCAAGCACA AGTGCTTCAG CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG 840 

TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC AGCATCAGCG TCAACAAGTG CTTCGGCTTC 900 

AGCGTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTACG TCAGCCTCAG CAAGCACATC 960 

AGCTTCTGAA TCTGCATCAA CCAGTGCGTC AGCCTCAGCA TCGACAAGCG CCTCAGCTTC 1020 

AGCAAGTACC AGTGCGTCAG CCTCAGCAAG TACCAGTGCT TCAGCCTCAG CGTCGACAAG 1080 

TGCGTCGGCC TCAACCAGTG CATCTGAATC GGCATCAACC AGTGCGTCAG CCTCAGCAAG 1140 

TACTAGCGCC TCAGCCTCAG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTAGTGCATC 1200 
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AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC AGCGCCTCAG CTTCAGCAAG 1260 

CACCAGTGCG TCAGCCTCAG CAAGTACCAG CGCCTCAGCC TCAGCAAGCA CCAGTGCCTC 1320 

AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG 1380 

TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT TCAGCATCAA CAAGTGCTTC 1440 

AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG AGTGCTTCAG TCTCAGCGTC 1500 

AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT TCAGCAAGCA CCAGTGCGTC 1560 

GGCTTCAGCA AGTACTAGTG CATCGGCTTC AGCATCGACA AGTGCGTCTG AATCGGCATC 1620 

AACGAGTGCT TCGGCTTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 1680 
TGAATCTGCA TCAACCAGTG CG 
(2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 941 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 

ACCAGTGCAT CAGCTTCAGC CTCAACAAGT GCTTCAGCCT CAGCGTCAAC CAGTGCCTCG 60 

GCTTCAGCAA GTACCAGTGC GTCACTTCAG CAAGCACAAG TGCGTCACTT CAGCATCAAC 120 
CAGTGCTTCG GCTTCGGCAT CAACAAGTGC CTCAGCATCA GCATCAACGA GTGCGTCACC 
TCAGCAAGTA CTAGTGCATC AGCATCAGCA TCAACCAGTG CATCAGCCTC AGCAAGTATC 

TCAGCGTCTG AATCGGCATC AACGAGTGCA TCAGCATCAG CATCAACGAG TGCATCGGCT 300 

TCAGCGTCAA CCAGTGCATC AGTCTCAGCA AGCACCAGTG CGTCGGCTTC AGCATCAACG 3 60 

AGTGCCTCAG CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCAGCC 420 

TCAGCAAGTA CTAGTGCATC GGCTTCAGCA AGCACCAGTG CGTCGGCTTC AGCATCAACG 480 

AGTGCCTCAG CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCAGCC 540 

TCAGCAAGTA CTAGTGCATC AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC 600 

AGCGCCTCAG CTTCAGCAAG CACCAGTGCG TCAGCCTCAG CAAGTACCAG CGCCTCAGCC 660 

TCAGCAAGCA CCAGTGCCTC AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA 720 

AGTGCGTCGG CTTCAGCAAG TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT 780 

TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG 840 



180 
240 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 
CCGACAAGCA ATATTAAAAA GAGTAAACTA TTAACTAGTT AATTAACCGG TTTATTACTT 
TATAGTGAAT CAAATATACT TAAGAAAAGA GGAAAGAATG AAAATTAATA AAAAATATCT 
AGCAGGTTCA GTGGCAGTCC TTGCCCTAAG TGTTTGrTCC TATGAGCTTG GACGTTACCA 
AGCTGGTCAG GATAAGAAAG AGTCTAATCG AGTTGCTTAT ATAGATGGTG ATCAGGCTGG 
TCAAAAGGCA GAAAACTTCA CACCAGATGA AGTCAGTAAG AGGGAGGGGA TCAACGCCGA 
ACAAATTGTT ATCAAGATTA CGGATCAAGG TTATGTGACC TCTCATGGAG ACCATTATCA 
TTACTATAAT GGCAAGGTTC CTTATCATGC CATCATCAGT GAAGAGCTCC TCATGAAAGA 
TCCGAATTAT CAGTTGAAGG ATTCAGACAT TGTCAATGAA ATCAAGGGTG GTTATGTCAT 
TAAGGTAAAC GGTAAATACT ATGTTTACCT TAAGGATGCA GCTCATGCGG ATAATATTCG 
GACAAAAGAA GAGATTAAAC GTCAGAAGCA GGAACGCAGT CATAATCATA ACTCAAGAGC 
AGATAATGCT GTTGCTGCAG CCAGAGCCCA AGGACGTTAT ACAACGGATG ATGGGTATAT 
CTTCAATGCA TCTGATATCA TTCAGGACAC GGGTGATGCT TATATCGTTC CTCACGGCGA 
CCATTACCAT TACATTCCTA AGAATGAGTT ATCAGOTAGC GAGTTAGCTG CTGCAGAAGC 
CTATTGGAAT GGGAAGCAGG GATCTCGTCC TTCTTCAAGT TCTAGTTATA ATGCAAATCC 
AGCTCAACCA AGArTGTCAG AGAACCACAA TCK5ACTGTC ACTCCAACTT ATCATCAAAA 
TCAAGGGGGA AACATTTCAA GCCTTTTACG TGAATTGTAT GCTAACCCTT ATCAGAACGC 
CATGTGGGAT CTG 

(2) I^3FORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 
GGTCGCATCT GCAATATCTG TCGCCTCCAC ATAAGCGACA CCAGCCTTGT CTGCTGCCCG 
rrTGACACGT TCTGCAGAT^ GACCCAGGAT GACCATC^TC TTGAGTCCAG TAATGTCTGG 
CACCAATTCG TCAAACTCAT TGCCACGGTC CAAACCACCT GCAATCAAGA CGACCTTGCT 
GTTGTCAAAT CCTGACAAGC TTTTTGAGTA GCCAAGATAT TAGTTGATTT ACTGTCGTTA 
TAGAATTTAA CACsCTTGAT GTCATCCACA AACTGGAGAC GGTGTTTGAC ACCACCGAAG 
GCTGAAAGAG TTTCCTTGAT GGTTTGATTG TCCACATCAC GAAGCTTGGC TACAGCAATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
973 



60 
120 
180 
240 
300 
360 
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420 

480 

540 

600 

660 

720 

•780 

840 

843 
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(D) TOPOLOGY: linear 



SEQUENCE DESCRIPTION: SEQ ID NO: 357. ^ ^^^^^^^^g 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
807 
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(2) INFORMATION FOR SEQ ID NO: 358: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 653 base pairs 
(B) TYPE: nucleic acid 
(CI STRANDEDNESS: double 
(D) TOPOLOGY: linear 



600 

553 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

CCCAGTATTT TTGTCCAAGC ACGACCAGAA AAGGATGATA CAGATCTGGA ATTGGCTCTC €0 

TTAACCATCT tTGAACAAAA TCCTCAGGCT CAGGTCACTA TTTTCGGTGC CTTGGGTGGC 120 

CGTATTGACC ATATGTTGGC CAATGTCTTT CTGCCTAGCA ATCCTAAGTT GGCACCCTAT 180 

ATGCATCAAA TAGAAATTGA GGATGGGCAA AACTTGATTA CTTATTGTCC AGAAGGAATC 240 

AGTCAGCTAG AACCTCGTTC AGACTACGAC TATCTAGCCT TTATGCCAGT TCGGGATAGC 3 00 

CAAGTATGAG TTGACAGAGG AAAATTTTTT CTTTAAAAAA GTGTACGCTT CTAACGAATA 360 

TATAGATAGG GAAGTGTCGG TAACTTGCCC AGATGGTTAT GTGGTCGTAC TGCATAGCAA 420 

GGACAGGAGG TAGGATGGAA AGTTTACTTA TTCTATTATT AATTGCCAAT CTAGCTGGTC 480 

TCTTTCTGAT TTGGCAAAGG CAGGATAGGC AGGAGAAACA CTTAAGTAAG AGCTTGGAGG 540 
ATCAGGCAGA TCATTTGTCA GACCAGCTGG ATTACCGCTT TGACCAAGCC AGACAAGCCA 
GCCAGTTAGA CCAAAAAGAT TTGGAAGTGG TTGTCAGCGA CCGTTTGCAA GAA 
(2) INPORKATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 

CACCATGTGA TGTGACGCTG GCCACAGCTG TCAGAAATCT GGCGAGCCAT CGTGTGCAAT 60 

GACTCTTCCC GATGTAATCT TGTTCATAGT CCTTTGATGA ATATGTTCAA GCTGTAGAAG 120 

GTGCGCTTCC TGAACACTTA TCAACTGTTA CAGGCGAGTT GACCAGTCAG GAAACAGATG 180 

GCTGGTACAC ACTTGCCAAC ACTTCTTCAT CCCGCATTTA CCTAAAACAA GCCTTCCAAG 240 

AAAATAGCAA CCTCCTAGAG CAAGTGGTAG AACCCTTGAC TATTATCACT GGTGGACACA 300 

ACCACAAGGA CCAGTTGACC TATGCTTGGA AAACACTTTT GCAGAATGCG CCACATGATA 360 

GTATCTGTGG CTGTAGCGTG GACGAAGTTC ACCGCGAGAT GGAAACGCGT TTTGCCAAGG 420 



wo 98/18931 



PCT/US97/19588 



1373 

TCAACCAAGT AGGAAACTTT GTTAAAAGTA ACTTGCTCAA CGAGTGGAAG GGTAAAATTG 
CTACGGATAA GGCTCAAAGT GACTATCTCT TTACTGTCAT TAACACAGGC TTCCATGATA 
.GGTCGATAC TGTCAGCACA GTGArTGATG TGGCGACTTG TGATTTCAAG GAATTGCACC 
CAACAGAAGG CTACAAAAAG ATGGCTGCTC TTATCTTGCC G 
(2) INFORMATION FOR SEQ ID NO: 360: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1958 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 
CCTCAAGGCC AATTTGAAGG CTCTAAAACA ATGGAAAAGT GCTACACAGA TGTGACAGAA 
nrcCCATTC CAGCAAGTAC TCAAAAGCTT TACTTATCAC CAGTTTTAGA TGGCTTTAAT 
^.GCOAAATTA TTGCTTTTAA ^TCGACT TCACCCAACT TAGAACAAGT ACAAACAATG 
TTAGAACAGG CATTCAAAGA GAAGCACTAC GAGAATACGA TTCTCCATAG TGACCAAGGC 

TGGCAATATC AACACGATTC TTATCATCGG TTCCTA<iAU>. ^x««oov - 

AT^TCACGCA AGGGCAACAG CCAAGACAAC GGTATGATGG AATC^TT TGGCA^A 
^TCCGAAA TGTTTTATGG CTATGAGAAA ACATTTAAAT CACTTAACCA AITGGAACAA 
GCCArrATAG ACTATATTCA TTACTACAAC AACAAACGAA rTAAGGTAAA ACTAAAAGGA 
CTTAGTCCTG TGCAGTACAG AACTAAATCC TTTOGATAAA TTAATTGTCT AACTTT^G 
CGTCAGTACA AAACTCTTGC TACTATGCGT ^ATTATTG AAAGACTTAT TGGACTTTCT 
CTCAAATCGA GTrmACTC AATTrTCTTA CTTGArrcCG ATTGAAATTC CAA^TAATTT 
CTCTGAGTAG AGTGTCTTGA TATTGGCTTC ATCAACAGAG GCCTTATCAA TTTTACGTTT 
CAAGAAAAAT TCTTGAATGG TTTCGATTTC AOGCTCACGA ATAGCACGGT GTTTGTTTGA 
GATGAGGATT TCATAGTGAA GCGGAGCrTG GGTAAAAATA ACATCTGTAT TCCCTGCAGA 
ATAAACCTCA ACAAGGCTTG CATCGGTACT TTCTAGCTGA CTTTTTACAA GTTGCGAGTG 
TGAGTTTGTC CTATTGATAA GCTTCATAAT ATTTCCTCCG ATTTTCTAAT TCTATTATAG 
CACTTTTTGA ATAAAGTCGC TTGATTTATA CTCAATGAAA ATCAAAGAGC AAACTAGGAA 
GCTAGCCGCA GGCTATACTT GAGTACGGTA AGGCGACGCT GACGTGGl^ GAATTTTATT 
TTCGAAGAGT ATTAGCCAAT CTTATGCTGT TTTTTCCAAG ATTCAATGGC CCATTTATGG 
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CTACCACGTT TAAGGTTTTT GATAGCCTCG TCAATAGGGA ACCAGGCAAT ATGATTAAAG 
TTTTCTAGTG GCTTTI^TAC TTCTTTGAAA GGAGTTGCTT CATAGAGGTA GGCAGGATTG 
TAGTAGTAGG TATCACGATG ACGAGAATAG AAATATTCGT CAGCTTGTCC GTAATAGGTA 
CCAAm-CTG CTGTGAAACC AAGCTCTTCA ATCAACTCAT GCTTTAGGGC. TTCCTGATGA 
^CACCTG CTTCAATTTC TCCACATGGT AGGAACCAAG CACCA-™ TTCTTGAACA 
AGAACAATTT GT™TTC AGGATTAGGG ATAACTGCAT ATACGCCATA GCGAGCAATA 
TAGTCTGTAT TCACTTTTrT TCTCCGAAAG TTGGGTrTGC CATTGCAT^T TCCTCATTAT 
CTAGTATCGT TATTATTATA GTGAAATGAA CCAAAAATAG TACACAATGT GGTATAATCT 
TCrTATGGCA TATTCAATAG ATTTTCGTAA AAAAGTOTC TCTTArTG^ AGCGAACAGG 
TAGTATAACA GAAGCATCAC ACGTTrTCCA AATCTCACGT AATACCATTT ATGGCTGGTT 
AAAGCTAAAA GAGAAAACAG GAGAGCTAAA CCACCAAGTA AAAGGAATAA AACCAAGAAA 
GGTTGATAGA GATAGACTTA AAAACTATCT TACTGACAAT CCAGACGCTT ATTTGACTGA 
AATAGCTTCT GAATTTGGCT GTCATCCAAC TACCATCCAC TATGCGC^A AAGCTATGGG 
tACACTCGAA AAAAAAAAGA ACTACACCTA CTATGAAC 
(2) INFORMATION FOR SEQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 
TATGAAATTA AGTTATGATG ATAAAGTTCA GATCTATGAA CTTAGAAAAC AAGGATATAG 
CTTAGAGAAG CTTTCAAATA AATTTGGGAT A»ACAATTCT AATCTTAGGT ATATGATTAA 
ATTGATTGAT CGTTACGGAA TAGAGl^CGT CAAAAAAGGA AAAAATCGTT ACTATTCTCC 
TGATTTAAAA CAAGAAATGA ^TAATAAAGT CTGACATGAA GGCTGGACTA AAGATAGAGT 
TTCTCrn^ TACGGTCTCC CAAGTCGTAC GATACTTCTT AACTGGCTAG CACAATACAG 
GAAAAACGGG TATACTATTG T^^AAACC AAGAGGGAGA GTACCTGAGA GCGGAGAATG 
CCATCCTAAA AAAOTTAAGA GAACTCCGAT TGAAGGAGGA AAAAGAGAAA GAAGAAAGAC 
.GAAATTCTT TAAGAATTAA ^ACTGA^TT TTCGTTAGAT CTTCTTTTAA AAGTCATTAA 
ACTAGCTCGT TCGACCTACT ACTATCACTT GAAACAGCTA GATAAACCAG ATAAGGACCA 
AGAGCTTAAA GCTGAAATTC AATCCATTTT TATCGAACAC AAAGGAAATT ATGC^ATCG 
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TCGGATTTAT TTAGAACTAA GAAATCGTGG TTATCTGGTA AATCATAAAA GAGTTCAAGG 660 

CTTGATGAAA GTACTCAATt TACAAGCTAA AACGCGACAG AAACGAAAAT ATTCTTCTCA 720 

TAAAGGAGAC GTTGGCAAGA AGGCAGAGAA TCTCATTCAA GGCCAATTTG AAGGCTCTAA 780 

AACAATGGAA CAGTGCTACA CAGATGTGAC AGAATTTGCC ATTCCAGTAA GTACTTAAAA 840 
GCTTTACTTA T 

(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1168 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 

GGGTAGAATC GATATCTCCA ATGAGTTGGT tTAGCTGGTG AAACTGTAAA AAGATTTCGw 60 

CCAATTCAAG GTTGAGGCAT CGCAAACTAT GGACTGTTTC CTCGTCAGTT CTGGAAAGAA 120 

AACGGGATAA GGTTGGCTGT GAAGCAAGCT GCCCTCCTTC CAACAATTTT GGAAAGTAGG 180 

CATCAGCTGA CAATTCTTTA CAAGCATAGT CCGTTCCATA ACCTGTTAAC AGTTGAAAGA 240 

GGAACTGGAC AAGGATATCT GAATCCGAAT AACGACAGTA GCGGCGTTGG TCATTCGTTA 3 00 

CTAAATACTT AGAAATCCGC TCTTTTAGTT TCAACTGGGA AAAAAGTTCC TGAAAAAAGA 3 50 

TAAGACCACC ATACTGGGTT AAATGACCTC CATCGAAAGA TAGTTGGTAA AAAGACTTGT 420 

TTTGGAAGTG ATGATTTGGT AAACTGTTCA TGTGAGTTTC CTTTCTTTTT GTGTTTTTTT 4 80 

CTACACTTAT ACCATAAAGG GGAAACTCTT TTTTGTCTAG TAAAAAACAC CCATTGGGTG 540 

AAAAAAGAAA CCATCCAGGA TCTAAGCTAA GGCAAGGATT CTGGATGGTT TTTAGATTTG 600 

GGGTGAATAA TTGGGGTTTT AGCTGCTTGC GGCCAATCAG GTTCAGATAC AAAAACTTAC 660 

TCATCAACCT TTAGTGGAAA TCCAACTACA TTTAACTATC TATTAGACTA TTACGCTGAT 720 

AATATAGTCA ATTGAAACAA GAACAAGACA AAAGAGCCTC ATAAAAGGTA TTGCAACTTG 780 

GTAATACCTT TTTGAGGTGC TTTTTGATAT GAGCCCATGT TTTCTCAATA GGATTGTACT 840 

CAGGTGAGTA GGGAGGAAGA GGTAAAAGTT TATACCCAAA CTCTTCACAC AAGAGTTCTA 900 

ACTTACCCAT TCTATGGAAT CTTGCATTAT CCATAATAAT AACCGATGGT GTGTTTAATG 960 

TTGGTAAGAG AAATTTCTGA AACCAAGCTT CAAAAAAGTC GCTCGTCATC GTCTCTTCGT 1020 

AAGTTATTGG AGCGATTAAC TCACCATTTG TTAGACCTGC AACCAAAGAA ATCCTCTGAT 1080 
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ATCTTCTTCC AGATACrTTG CCTCrTCTTA ACTG^CCTTT TAATGAGCGA CCATATTCTC U40 

1168 

GATAAAAATA AGTATCGAAT CCTGTTTC 
(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 
GTCAGCTTCA GCAAGCCCAT CAGCTTCTGA ATC«3CATCA ACCAGTGCGT CCGCTTCAGC 
GTCAACCAGT GCGTCGGCTT CAGCGTCGAC AAGTGCTTCG GCTTCAGCAT CAACGAGTGC 
GTCGGCCTCA GCAAGCGCAA GTACCTCAGC GTCAGCTTCC GCCTCAACCA GTGCGTCGGC 
TTCAGCAAGC ACAAGTGCGT CAGCCTCAGC AAGTATCTCA GCGTCTGAAT CGGCATCAAC 
GAGTGCGTCT GAGTCAGCAT CAACGAGTAC GTCAGCCTCA GCAAGCACAT CAGCTTCTGA 
ATCTGCATCA ACCAGTGCGT CAGCCTCAGC ATCGACAAGC GCCTCAGCTT CAGCAAGTAC 
CAGTGCTTCA GCCTCAGCGT CGACAAGTGC GTCGGCCTCA ACCAGTGCAT CTGAATCGGC 
ATCAACCAGT GCGTCAGCCT CAGCAAGTAC TAGTGCATCA GCTTCAGCAT CAACGAGTGC 
^.TCGGCl^CA GCATCAACCA GTGCCTCGGC TTCAGCGTCA ACCAGTGCGT CAGCTTCAGC 
AAGTACCAGT GCTTCAGTCT CAGCATCAAC AAGTGCTTCA GCCTCAGCAT CGACAAGTGC 
CTCGGCTTCA GCAAGCACAT CAGCATC«5A ATCAGCGTCG ACAAGCGCCT CAGCTTCAGC 
;^GTACCAGT GCGTCAGCCT CAGCGTCGAC AAGTGCGTCA GCCTCAGCAA GTACTAGTGC 
ATCAGCTTCA GCATCAACGA GTGCATCGGC TTCGGCGTCA ACCAGTGCAT CAGAGTCAGC 
AAGTACCAGT GCGTCAGCTT CCGCATCAAC AAGTGCCTCG GCTTCAGCAA GCACCAGTGC 
GTCGGCTTCA GCAAGTACTA GCGCCTCAGC CTCAGCCTCA ACCAGTGCGT CAGCCTCAGC 
AAGTATCTCA GCGTCTGAAT CGGCATCAAC GAGTGCGTCC GCTTCAGCAA GTACTAGCGC 
CTCAGCCTCA GCGTCAACAA GTGCATCGGC TTCAGCGTCA ACGAGTGCGT CTGAATCGGC 
ATCAACGAGT GCGTCCGCTT CAGCAAGTAC TAGCGCCTCA GCCTCAGCGT CAACAAGTGC 
ATCGGCTTCA GCATCAACGA GTGCGTCCGC TTCAGCAAtJT ACTAGCGCCT CAGCCTCAGC 
GTCAACAAGT GCATCGGCTT CAGCGTCAAC GAGTGCGTCT GAGTCAGCAT CAACGAGTGC 
G^CCTCA GCAAGCACAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CAgCCTCAGC 
ATCGACAAGC GCCTCAGCTT CAGCAAGTAC CAGTGCGTCA GcTCAGCGTC GACAAGTGCs 
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TCrGCTTCAG 


CAAGTACCAG 


TGCGTCAGCC 


TCAGCAAGTA 


CCAGTGCJcTC 


AGCCTCAGCG 


1380 


TCGACAAGTG 


CGTCGGCCTC 


AACCAGTGCA 


TCTGAATCGG 


CATCAACCAG 


TGCGTCAGCC 


1440 


TCAGCAAGTA 


CTAGCGCCTC 


AGCCTCAGCA 


TCAACGAGTG 


CGTCCGCTTC 


AGCAAGTACT 


1500 


AGTGCATCAG 


CTTCAGCAAG 


TACTAGCGCC 


TCAGCCTCAG 


CGTCGACAAG 


CGCCTCAGCT 


1550 


TCAGCAAGTA 


CCAGTGCGTC 


AGCCTCAGCG 


TCGACAAGTG 


CGTCGGCTTC 


AGCAAGTACC 


1620 


TCAGCGTCTG 


AATCAGCATC 


AACAAGTGCG 


TCGGCTTCAG 


CATCAACGAG 


TGCATCAGCT 


1680 


TCAGCATCAA 


CAAGTGCTTC 


AGCTTCAGCA 


AGTACCAGTG 


CGTCGGCTTC 


AGCATCAACG 


1740 


AGTGCTTCAG 


TCTCAGCGTC 


AACCAGTGCC 


TCTGAATCGG 


CATCAACAAG 


TGCCTCGGCT 


1800 


TCAGCAAGCA 


CCAGTGCTTC 


GGCTTCAGCG 


TCAACGAGTG 


CGTCTGAGTC 


AGCATCAACG 


1660 


AGTGCGTCAC 


CTCAGCAAGC 


ACATCAGCTT 


CTGAATCTGC 


ATCAACCAGT 


GCGTCACTTC 


1920 


CGCATCAACA 


AGCGCCTCGG 


CCTCAGCAAG 


TACAAGTGCT 


TCAGCCTCAG 


CATCAACCAG 


1980 


TGCATCAGCT 


TCAGCCTCAA 


CAAGTGCTTC 


AGCCTCAGCG 


TCAACGAGTG 


CGTCGGCTTC 


2040 


AGCAAGTACC 


AGTGCGTCAG 


CTTCAGCAAG 


CACAAGTGCG 


TCAGCTTCAG 


CATCAACCAG 


2100 


TGCTTCGGCT 


TCGGCATCAA 


CAAGTGCCTC 


AGCATCAGCA 


TCAACGAGTG 


CGTCASCTCA 


2160 


GCAAGTACTA 


GTGCATCAGC 


ATCAGCATCA 


ACCAGTGCAT 


CAGCCTCAGC 


T^GTATCTCA 


2220 


GCGTCTGAAT 


CGGCATCAAC 


GAGTGCATCA 


GCATCAGCAT 


CAACGAGTGC 


ATCGGCTTCA 


2280 


GCGTCAACCA 


GTGCATCAGT 


CTCAGCAAGC 


ACCAGTGCGT 


CGGCTTCAGC 


ATCAACCAGT 


2340 


GCCTCAGCCT 


CAGCAAGTAT 


CTCAGCGTCT 


GAATCGGCAT 


CAACGAGTGC 


GTCAGcCTCA 


2400 


GCAAGTACTA 


GTGCATCAGC 


ATCAGCATCA 


ACGAGTGCAT 


CGGCTTCAGC 


AAGTACCAGC 


2460 


GCCTCAGCTT 


CAGCAAGCAC 


CAGTGCGTCA 


GCCTCAGCAA 


GTACCAGCGC 


GTCAGCCTCA 


2520 


GCAAGCACCA 


GTGCCTCAGC 


TTCAGCAAGT 


ACCAGTGCGT 


CAGCCTCAGC 


GTCGACAAGT 


2580 


GCGTCGGCTT 


CAGCAAGTAC 


CTCAGCGTCT 


GAATCAGCAT 


CAACGAGTGC 


ATCAGCTTCA 


2640 


GCATCTU^CAA 


GTGCTTCAGC 


TTCAGCAAGT 


ACCAGTGCGT 


CGGCTTCAGC 


ATCAACCAGT 


2700 


GCTTCAGTCT 


CAGCGTCAAC 


CAGTGCCTCT 


GAATCAGCAT 


CAACAAGTGC 


CTCGGCTTCA 


2760 


GCAAGCACCA 


GTGCGTCGGC 


TTCAGCAAGT 


ACTAGTGCAT 


CGGCTTCAGC 


ATCGACAAGT 


2820 


GCGTCTGAAT 


CGGCATCAAC 


GAGTGCTTCG 


GCTTCAGCAT 


CAACGAGTGC 


GTCAGCCTCA 


2880 


GCAAGCACAT 


CAGCTTCTGA 


ATCTGCATCA 


ACCAGTGCGT 


CGGCTTCAGC 


GTCAACCAGT 


2940 


GCGTCGGCTT 


CAGCGTCGAC 


AAGTGCTTCG 


GCTTCAGCAT 


CAACGAGTGC 


GTCGGCCTCA 


3000 


GCAAGCGCAA 


GTACCTCAGC 


GTCAGCTTCC 


GCCTCAACCA 


GTGCGTCGGC 


TTCAGCAAGC 


3060 
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ACAAGTGCGT CAGCCTCAGC AAGTATCTCA GCGTCTGAAT 
GCCTCAGCAA GCGCAAGTAC CTCAGCGTCA GCTTCCGCCT 
GCAAGCACAA GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT 
GCGTCTGAGT CAGCATCAAC GAGTACGTCA GCCTCAGCAA 
GCATCAACCA GTGCGTCAGC CTCAGCATCG ACAAGCGCCT 
GCTTCAGCCT CAGCGTCGAC AAGTGCGTCG GCCTCAACCA 
ACCAGTGCGT CAGCCTCAGC AAGTACTAGT GCATCAGCTT 
GCTTCAGCAT CAACCAGTGC CTCGGCTTCA GCGTCAACCA 
ACCAGTGCTT CAGTCTCAGC ATCAACAAGT GCTTCAGCCT 
GCTTCAGCAA GCACATCAGC ATCTGAATCA GCGTCGACAA 
ACCAGTGCGT CAGCCTCAGC GTCGACAAGT GCGTCAGCCT 
GCTTCAGCAT CAACGAGTGC ATCGGCTTCG GCGTCAACCA 
ACCAGTGCGT CAGCTTCCGC ATCAACAAGT GCCTCGGCTT 
GCTTCAGCAA GTACTAGCGC CTCAGCGTCA GCCTCAACCA 
ATCTCAGCGT CTGAATCGGC ATCAACGAGT GCGTCCGCTT 
GCCTCAGCGT CAACAAGTGC ATCGGCTTCA GCGTCAACCA 
ACCAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCCT 
GCTTCAGCAT CAACGAGTGC GTCCGCTTCA GCAAGTACTA 
ACAAGTGCAT CGGGTTCAGC GTCAACGAGT GCGTCTGAGT 
CCTCAkCAAG CACATCAGCT TCTGAATCTG CATCAACCAG 
AAGCGCCTCG GCCTCAGCAA GTACAAGTGC TTCAGCCTCA 
TTCAGCCTCA ACAAGTGCTT CAGCCTCAGC GTCAGACCAG 
CCAGTGCGTC ACTTCAGCAA GCACAAGTGC GTCAGCTTCA 
TTCGGCATCA ACAAGTGCCT CAGCATCAGC ATCAACGAGT 
(2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



CGGCATCAAC 
CAACCAGTGC 
CTGAATCGGC 
GCACATCAGC 
CAGCTTCAGC 
GTGCATCTGA 
CAGCATCAAC 
GTGCGTCAGC 
CAGCATCGAC 
GCGCCTCAGC 
CAGCAAGTAC 
GTGCATCAGA 
CAGCAAGCAC 
GTGCGTCAGC 
CAGCAAGTAC 
GTGCGTCTGA 
CAGCGTCAAC 
GCGCCTCAGC 
CAGCATCAAC 
TGCGTCACTT 
GCATCAACCA 
TGCCTCGGCT 
GCATCAACCA 
GCG 



GAGTGCGTCG 
GTCGGCTTCA 
ATCAACGAGT 
TTCTGAATCG 
AAGTACCAGT 
ATCGGCATCA 
GAGTGCATCG 
TTCAGCAAGT 
AAGTGCCTCG 
TTCAGCAAGT 
TAGTGCATCA 
GTCAGCAAGT 
CAGTGCGTCG 
CTCAGCAAGT 
TAGCGCCTCA 
ATCGGCATCA 
AAGTGCATCG 
CTCAGCGTCA 
GAGTGCGTCA 
CCGCATCAAC 
GTGCATCAGC 
TCAGCAAGTA 
GTGCTTCGGC 



3120 
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3480 

3540 
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GTACCTCAGC GTCCTTCCGC CTCAACCAGT GCGTCCGCTT CAGCAAGCAC AAGTGCGTCA 60 

CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCGGCC TCAGCAAGCG 120 

CAAGTACCTC AGCGTCACTT CCGCCTCAAC CAGTGCGTCG GCTTCAGCAA GCACAAGTGC 180 

GTCAsCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC TGAGTCAGCA 240 

TCAACGAGTA CGTCAGCCTC AGCAAGCACA TCAGCTTCTG AATCGGCATC AACCAGTGCG 300 

TCAGCCTCAG CATCGACAAG CGCCTCAGCT TCAGCAAGTA CCAGTGCTTC AGCCTCAGCG 3 60 

TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACGAG TGCGTCAGCC 420 

TCAGCAAGTA CTAGTGCATC AGCTTCAGCA TCAACGAGTG CATCGGCTTC AGCATCAACC 480 

AGTGCCTCGG CTTCAGCGTC AACCAGTGCG TCAGCTTCAG CAAGTACCAG TGCTTCAGTC 540 

TCAGCATCAA CAAGTGCTTC AGCCTCAGCA TCGACAAGTG CCTCGGCTTC AGCAAGCACA 600 

TCAGCATCTG AATCAGCGTC GACAAGTGCG TCGGCCTCAA CCAGTGCATC TGAATCGGCA 660 

TCAACCAGTG CGTCAGCCTC AGCAAGTACT AGTGCATCAG CTTCAGCATC AACGAGTGCA 720 

TCGGCTTCGG CGTCAACCAG TGCATCAGAG TCAGCAAGTA CCAGTGCGTC AGCTTCCGCA 780 

TCAACAAGTG CCTCGGCTTC AGCAAGCACA TCAGCATCTG AATCAGCGTC AACCAGTGCT 840 

TCGGCTTCAG CAAGTACCAG TGCTTCAGCT TCAGCATCAA CCAGCGCCTC GGCCTCAGCA 900 

AGCACCTCAG CTTCTGAATC GGCCTCAACC AGCGCCTCGG CCTCAGCAAG CACCTCAGCT 960 

TCTGAATCGG CCTCAACCAG CGCCTCAGCC TCAGCATCAA CGAGTGCTTC GGCTTCAGCA 1020 

AGCACAAGCG CCTCGGCTTC AGCATCAACG AGTACGTCAG CTTCAGCGTC AACCAGTGCT 1080 

TCAGCCTCAG CATCAACAAG TGCGTCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 1140 

TCAACGAGTG CGTCTGAGTC AGCATCAACG AGTACGTCAG CCTCAGCAAG CACAAGTGCT 1200 

TCAGCCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC CGCTTCAGCA 1260 

AGTACTAGCG CCTCAGCATC AGCGTCAACA AGTGCTTCGG CTTCAGCGTC AACCAGTGCG 1320 

TCTGAGTCAG CATCAACGAG TACGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA 1380 

TCAACCAGTG CGTCAGCCTC AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG 1440 

TCAGCCTCAG CAAGTACCAG TGCTTCAGCC TCAGCGTCGA CAAGTGCGTC GGCCTCAACC 1500 

AGTGCATCTG AATCGGCATC AACCAGTGCG TCAGCTCAGC AAGTACTAGT GCATCAGCTT 1560 

CAGCATCAAC GAGTGCATCG GCTTCGGCGT CAACCAGTGC ATCAGAGTCA GCAAGTACCA 1620 

GTGCGTCACt TCGGCATCAA CAAGTGCCTC GGCTTCAGCA AGCACATCAG CATCTGAATC 1680 

AGCGTCAACC AGTGCTTCGG CTTCAGCAAG TACCAGTGCT TCAGCTTCAG CATCAACGAG 1740 
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CGCCTCGGCC TCAGCAAGCA CCTCAGCTTC TGAATCGGCC TCAACCAGCG CCTCGGCCTC 
AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCAGCGCC TCAGCCTCAG CATCAACGAG 
TGCTTCGGCT TCAGCAAGCA CAAGCGCCTC GGGTTCAGCA TCAACGAGTA CGTCAGCTTC 
AGCGTCAACC AGTGCTTCAG CCTCAGCATC AACAAGTGCG TCAGCCTCAG CAAGTATCTC 
AGCGTCTGAA TCGGCATCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA CGTCAGCCTC 2040 
AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCAGTGCG TCAGCCTCAG CATCGACAAG 2100 
CGCCTCAGCT TCAGCAAGTA CCAGTGCTTC AGCCTCAGCG TCGACAAGTG CGTCGGCCTC 2160 
AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC TCAGCAAGTA CTAGTGCATC 2220 
GGCTTCAGCA TCAACCAGTG CCTCGGCTTC AGCGTCAACC AGTGCGTCAG CTTCAGCAAG 2280 
TACCAGTGCT TCAGTCTCAG CATCAACAAG TGCTTCAGCC TCAGCATCGA CAAGTGCCTC 2340 
GGCTTCAGCA AGCACATCAG CATCTGAATC AGCGTCGACA AGCGCCTCAG CTTCAGCAAG 2400 
TACCAGTGCG TCAGCCTCAG CGTCGACAAG TGCGTCAGCT ACAGCAAGTA CTAGTGCATC 2460 
AGCTTCAGCA TCAACGAGTG CATCGGCTTC GGCGTCAACC AGTGCATCAG AGTCAGCAAG 2520 
TACCAGTGCG TCAGTTCACG CATCAACAAG 
(2) INFORMATION FOR SEQ ID NO: 365: 
( i ) SEQUENCE CHARACTERISTICS : 

fill T KTlT^fTTU . ^A'7C n^i.^'E 

i <-v ; ui^vva^i*. t — 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2550 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 
ACCCAGCAAG TACTAGTGCA TCGGCTTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA 
CCAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAG 
CTCAGCAAGT ACTAGTGCAT CAGCATCAGC ATCAACGAGT GCATCGGCTT CAGCAAGTAC 
CAGCGCCTCA GCTTCAGCAA GCACCAGTGC GTCAsCTCAG CAAGTACCAG CGCCTCAGCC 
TCAGCAAGCA CCAGTGCCTC AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA 
AGTGCGTCGG CTTCAGCAAG TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT 
TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTATCTCAG CGTCTGAATC GGCATCAACG 420 
AGTGCGTCCG CTTCAGCAAG TACTAGCGCC TCAGCATCAG CGTCAACAAG TGCTTCGGCT 480 
TCAGCGTCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA CGTCAGCCTC AGCAAGCACA 540 
TCAGCTTCTG AATCTGCATC AACCAGTGCG TCAGCCTCAG CATCGACAAG CGCCTCAGCT 
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TCAGCAAGTA CCAGTGCGTC AgCCTCAGCA AGTACCAGTG CTTCAGCCTC AGCGTCGACA 660 

AGTGCGTCGG CCTCAACCAG TGCATCTGAA TCGGCATCAA CCAGTGCGTC AGCCTCAGCA 720 

AGTACTAGCG CCTCAGCCTC AGCATCAACG AGTGCGTCCG CTTCAGCAAG TACTAGTGCA 780 

TCAGCTTCAG CAAGTACTAG CGCCTCAGCC TCAGCGTCGA CAAGCGCCTC AGCTTCAGCA 840 

AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG TACCTCAGCG 900 

TCTGAATCAG CATCAACAAG TGCGTCGGCT TCAGCATCAA CGAGTGCATC AGCTTCAGCA 9 60 

TCAACAAGTG CTTCAGCTTC AGCAAGTACC AGTGCGTCGG CTTCAGCATC AACGAGTGCT 1020 

TCAGTCTCAG CGTCAACCAG TGCCTCTGAA TCCGCATCAA CAAGTGCCTC GGCTTCAGCA 1080 

AGCACCAGTG CTTCGGCTTC AGCGTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTGCG 1140 

TCAGCCTCAG CAAGCACATC AGCTTCTGAA TCTGCATCAA CCAGTGCGTC AGCTTCCGCA 1200 

TCAACAAGCG CCTCGGCCTC AGCAAGTACA AGTGCTTCAG CCTCAGCATC AACCAGTGCA 1260 

TCAGCTTCAG CCTCAACAAG TGCTTCAGCC TCAGCGTCAA CCAGTGCCTC GGCTTCAGCA 1320 

AGTACCAGTG CGTCAGCTTC AGCAAGCACA AGTGCGTCAG CTTCAGCATC AACCAGTGCT 1380 

TCGGCTTCGG CATCAACAAG TGCCTCAGCA TCAGCATCAA CCAGTGCGTC AGCCGG 143 6 
(2) INFORMATION FOR SEQ ID NO : 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 66: 

GCAGTTGCCA CACCGTGCTG ACCAGCACCC GTTCCTGCGA TAATTTTCTT TTTACCCATG 60 

CGTwTGGCAA GCCAAACTTG TCCTAAGGCA TTGTTAATCT TGTGGGCTCC TGTATGGTTA 120 

AGGTCTTCCC GTTTGAGATA AATCTTGCTC CGCCAATATG CTGGGTCAAG TTTTTTGCGT 180 

AATAAAGAGG AGTTTCACGT CCTACGTACT GGCGCAAAAG CTGGTTTAAT TCCTCTTGGA 240 

AACTTGGGTC TGCCTGACTT TCACGGTAGG CCTTCTCCAA CTCCAAAACT GCTGTCATCA 300 

ATGTTTCTGG GACAAAACGT CCGCCGAATT TTCCGTAAAA TCCATCTTTA TTTGGTTCCT 360 

GATATGCCAT GCTTTACCCT CTCTATAAAT CTTCTAATCT TTTCATGATC TTTTTGTCCA 420 

TCTGTCTCCA CTCCGCTCGA TACATCTACT GCATAGGGAG TAAAGTGTTG AATTGCTTTT 480 

ACTACATTAT CTTCATTAAG GCCACCTGCG ATAAAGAAGG GCTGTGCTAG TCCAGTCGTA 540 
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TCCAGTTGAC CCCAATCAAA GGGCTGGCCA CTTCCTGCCA CAGGGGCATC AAAGAGTAGA 
TAATCTGCCT GAGAATTGGG GACATGCCCA TTTCCATCTA CCTGCACAGC CTGAATACTG 
GCACAAGGCA AATTCTCAAA TAAATCATCT GCCACCTGAC CGTGAACTTG AACCAAGTCC 
AAGCCGGGGA TCCTC 

(2) INFORMATION FOR SEQ ID NO: 3 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 

TACTAGCGCC TCAGCCTCAG CGTCAACAAG TGCATCGGCT TCAGCATCAA CGAGTGCGTC 60 

CGCTTCAGCA AGTACTAGCG CCTCAGCCTC AGCGTCAACA AGTGCATCGG CTTCAGCGTC 120 

AACGAGTGCG TCTGAGTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 180 

TGAATCTGCA TCAACCAGTG CGTCAGCCTC AGCATCGACA AGCGCCTCAG CTTCAGCAAG 240 

TACCAGTGCG TCAGCCTCAG CGTCGACAAG TGCGTCGGCT TCAGCAAGTA CCAGTGCGTC 300 

AGCCTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CCTCAACCAG 360 

TGCATCTGAA TCGGCATCAA CCAGTGCGTC AGCCTCAGCA AGTACTAGTG CATCAGCTTC 420 

AGCATCAACG AGTGCATCGG CTTCAGCATC AACCAGTGCA TCAGAGTCAG CAAGTACCAG 480 

TGCGTCAgCT TCGGCATCAA CAAGTGCCTC GGCTTCAGCA AGTACTAGCG CCTCAGCCTC 540 

AGCGTCAACA AGTGCTTCAG CTTCCGCGTC AACCAGCGCC TCGGCCTCAG CAAGTATCTC 600 

AGCGTCTGAA TCGGCATCAA CAAGTGCCTC GGCTTCAGCA TCAACGAGTG CATCAGTCTC 6 60 

AGCAAGCACC AGTGCGTCGG CCTCAGCAAG CACCAGCGCG TCTGAATCCG CATCAACCAG 720 

TGCCTCAGCT TCAGCAAGTA CCTCAGCATC TGAATCAGCA TCAACAAGTG CATCGGCT-C 780 

AGCAAGCACA AGTGCTTCAG CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG 840 

TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC AGCATCAGCG TCAACAAGTG CTTCGGCTTC 900 

AGCGTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTACG TCAGCCTCAG CAAGCACATC 960 

AGCTTCTGAA TCTGCATCAA CCAGTGCGTC AGCCTCAGCA TCGACAAGCG CCTCAGCTTC 1020 

AGCAAGTACC AGTGCGTCAG CCTCAGCAAG TACCAGTGCT TCAGCCTCAG CGTCGACAAG 1080 

TGCGTCGGCC TCAACCAGTG CATCTGAATC GGCATCAACC AGTGCGTCAG CCTCAGCAAG 1140 
TACTAGCGCC TCAGCCTCAG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTAGTGCATC 



1200 
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AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC AGCGCCTCAG CTTCAGCAAG 
CACCAGTGCG TCAGCCTCAG CAAGTACCAG CGCCTCAGCC TCAGCAAGCA CCAGTGCCTC 1320 
AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG 1380 
TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT TCAGCATCAA CAAGTGCTTC 1440 
AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG AGTGCTTCAG TCTCAGCGTC 1500 
AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT TCAGCAAGCA CCAGTGCGTC 1560 
GGCTTCAGCA AGTACTAGTG CATCGGCTTC AGCATCGACA AGTGCGTCTG AATCGGCATC 1620 
AACGAGTGCT TCGGCTTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 1680 
TGAATCTGCA TCAACCAGTG CG 
(2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 941 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1702 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 



ACCAGTGCAT 


CAGCTTCAGC 


CTCAACAAGT 


GCTTCAGCCT 


CAGCGTCAAC 


CAGTGCCTCG 


60 


GCTTCAGCAA 


GTACCAGTGC 


GTCACTTCAG 


CAAGCACAAG 


TGCGTCACTT 


CAGCATCAAC 


120 


CAGTGCTTCG 


GCTTCGGCAT 


CAACAAGTGC 


CTCAGCATCA 


GCATCAACGA 


GTGCGTCACC 


180 


TCAGCAAGTA 


CTAGTGCATC 


AGCATCAGCA 


TCAACCAGTG 


CATCAGCCTC 


AGCAAGTATC 


240 


TCAGCGTCTG 


AATCGGCATC AACGAGTGCA TCAGCATCAG CATCAACGAG 


TGCATCGGCT 


300 


TCAGCGTCAA 


CCAGTGCATC 


AGTCTCAGCA 


AGCACCAGTG 


CGTCGGCTTC AGCATCAACG 


360 


AGTGCCTCAG 


CCTCAGCAAG 


TATCTCAGCG 


TCTGAATCGG 


CATCAACGAG 


TGCGTCAGCC 


420 


TCAGCAAGTA 


CTAGTGCATC 


GGCTTCAGCA 


AGCACCAGTG 


CGTCGGCTTC 


AGCATCAACG 


480 


AGTGCCTCAG 


CCTCAGCAAG 


TATCTCAGCG 


TCTGAATCGG 


CATCAACGAG 


TGCGTCAGCC 


540 


TCAGCAAGTA 


CTAGTGCATC 


AGCATCAGCA 


TCAACGAGTG 


CATCGGCTTC 


AGCAAGTACC 


600 


AGCGCCTCAG 


CTTCAGCAAG 


CACCAGTGCG 


TCAGCCTCAG 


CAAGTACCAG 


CGCCTCAGCC 


660 


TCAGCAAGCA 


CCAGTGCCTC 


AGCTTCAGCA 


AGTACCAGTG 


CGTCAGCCTC 


AGCGTCGACA 


720 


AGTGCGTCGG 


CTTCAGCAAG 


TACCTCAGCG 


TCTGAATCAG 


CATCAACGAG 


TGCATCAGCT 


7 80 


TCAGCATCAA 


CAAGTGCTTC 


AGCTTCAGCA 


AGTACCAGTG 


CGTCGGCTTC 


AGCATCAACG 


840 
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AGTGCTTCAG TCTCAGCGTC AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT 900 
TCAGCAAGCA CCAGTGCGTC GGCTTCAGCA AGTACTAGTG C 941 
(2) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 

CAGCAAGTAC TAGTGCATCA GCTTCAGCAT CAACGAGTGC ATCGGCTTCT GCGTCAACCA 60 

GTGCATCAGA GTCAGCAAGT ACCAGTGCGT CAGCTTCCGC ATCAACAAGT GCCTCGGCTT 120 

CAGCAAGCAC CAGTGCGTCG GCTTCAGCAA GTACTAGCGC CTCAGCCTCA GCCTCAACCA 180 

GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT CTGAATCGGC ATCAACGAGT GCGTCCGCTT 240 

CAGCAAGTAC TAGCGCCTCA GCCTCAGCGT CAACAAGTGC ATCGGCTTCA GCGTCAACGA 300 

GTGCGTCTGA ATCGGCATCA ACGAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCCT 360 

CAGCGTCAAC AAGTGCATCG GCTTCAGCAT CAACGAGTGC GTCCGCTTCA GCAAGTACTA 420 

GCGCCTCAGC CTCAGCGTCA ACAAGTGCAT CGGC'I'TCAGC GTUAAcGaGT uCGTC-rGAGT 480 

CAGCATCAAC GAGTGCGTCA GCCTCAGCAA GCACATCAGC TTCTGAATCT GCATCAACCA 540 

GTGCGTCAGC CTCAGCATCG ACAAGCGCCT CAGCTTCAGC AAGTACCAGT GCGTCAGCCT 600 

CAGCGTCGAC AAGTGCGTCG GCTTCAGCAA GTACCAGTGC GTCAGCCTCA GCAAGTACCA 660 

GTGCGTCAGC CTCAGCGTCG ACAAGTGCGT CGGCCTCAAC CAGTGCATCT GAATCGGCAT 720 

CAACGAGTGC GTCAGCCTCA GCAAGTACTA GTGCATCAGC TTCAGCATCA ACGAGTGCAT 780 

CGGCTTCAGC ATCAACCAGT GCATCAGAGT CAGCAAGTAC GAGTGCGTCA GnTTCCGCAT 840 
GCAACAAGTG CCTCGGCTTC AGCAAGTAC 
(2) INFORMATION FOR SEQ ID NO: 370: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



869 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 
TCAACAAGTG CCTCAGCATC AGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 



60 
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TCAGCATCAG CATCAACCAG TGCATCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 
TCAACGAGTG CATCAGCATC AGCATCAACG AGTGCATCGG CTTCAGCGTC AACCAGTGCA 
TCAGTCTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCCTC AGCCTCAGCA 
AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 
TCGGCTTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CCAGTGCCTC AGCCTCAGCA 
AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 
TCAGCATCAG CATCAACGAG TGCATCGGCT TCAGCAAGTA CCAGCGCCTC AGCTTCAGCA 
AGCACCAGTG CGTCAGCCTC AGCAAGTACC AGCGCCTCAG CCTCAGCAAG CACCAGTGCC 
TCAGCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCGTCGA CAAGTGCGTC GGCTTCAGCA 
AGTACCTCAG CGTCTGAATC AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT 
TCAGCTTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC GGCTTCAGCA 
AGTACTAGCG CCTCAGCATC AGCGTCAACG 
(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 957 base pairs 

(B) TYPE; nucleic acid 

( C ) STRAI'JDEDKES 3 : uuub It: 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 
CCGGAAAACA GCTCTGGCGC TTGGTCTTGC CCAGCGTATT GCTAGTGGTG ACGTGCCTGC 
GGAAATGGCT AAGATGCGCG TGTTAGAACT TGATTTGATG AATGTCGTTG CAGGGACACG 
CTTCCGTGGT GACTTTGAAG AACGCATGAA TAATATCATC AAGGATATTG AAGAAGATGG 
CCAAGTCATC CTCTTTATCG ATGAACTCCA CACCATCATG GGTTCTGGTA GCGGGATTGA 
TTCGACTCTG GATGCGGCCA ATATCTTGAA ACCAGCCTTG GCGCGTGGAA CTTTGAGAAC 
GGTTGGTGCC ACTACTCAGG AAGAATATCA AAAACATATC GAAAAAGATG CGGCACTTTC 
TCGTCGTTTC GCTAAAGTGA CGATTGAAGA ACCAAGTGTG GCAGATAGTA TGACTATTTT 
ACAAGGTTTG AAGGCGACTT ATGAGAAACA TCACCGTGTA CAAATCACAG ATGAAGCGGT 
TGAAACAGCG GTTAAGATGG CTCATCGTTA TTTAACCAGT CGTCACTTGC CAGACTCTGC 
TATCGATCTC TTGGATGAGG CGGCAGCAAC AGTGCAAAAT AAGGCAAAGC ATGTAAAAGC 
AGACGATTCA GATTTGAGTC CAGCTGACAA GGCCCTGATG GATGGCAAGT GGAAACAGGC 
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AGCCCAGCTA ATCGCAAAAG AAGAGGAAGT ACCTGTCTAC AAAGAGTTGG TGACAGAGTC 720 

TGATATTTTG ACCACCTTGA GTCGCTTGTC AGGAATCCCA GTTCAAAAAC TGACTCAAAC 780 

GGATGCTAAG AAGTATTTAA ATCTTGAAGC AGAACTCCAT AAACGGGTTA TCGGTCAAGA 840 

TCAAGCTGTT TCAAGCATTA GCCGTGCCAT TCGCCGCAAC CAGTCAGGGA TTCGCAGTCA 900 

TAAGCGTCCG ATTGGTTCCT TTATGTTCCT AGGGCCTACA GGTGTCGGGG TATCCGA 957 



(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 

CAAAGCGCCT CAGCTTCAGC ATCAACAAGT GCGTCGGCTT CAGCATCAAC CAGTGCCTCG 60 

GCTTCAGCGT CAACCAGTGC GTCACATTCA GCAAGTACCA GTGCTTCAGT CTCAGCATCA 120 

ACAAGTGCTT CAGCCTCAGC ATCGACAAGT GCCTCGGCTT CAGCAAGCAC ATCAGCATCT 180 

GAATCAGCGT CAACCAGTGC TTCGGCTTCA GCAAGTACCA GTGCTTCAGC TTCAGCATCA 240 

al\;aGl:gc(JT cuui^L-rCAOC aauCACCtCA GCTTC'l'GAAT CGGCCTCAAC CaGCGCCtCC> 300 

GCCTCAGCAA GCACCTCAGC TTCTGAATCG GCCTCAACCA GCGCCTCAGC CTCAGCATCA 360 

ACGAGTGCTT CGGCTTCAGC AAGCACAAGC GCCTCGGGTT CAGCATCAAC GAGTACGTCA 420 

GCTTCAGCGT CAACCAGTGC TTCAGCCTCA GCATCAACAA GTGCGTCAGC CTCAGCAAGT 480 

ATCTCAGCGT CTGAATCGGC ATCAACGAGT GCGTCTGAGT CAGCATCAAC GAGTACGTCA 540 

GCCTCAGCAA GCACCTCAGC TTCTGAATCG GCCTCAACCA GTGCGTCAGC CTCAGCATCG 600 

ACAAGCGCCT CAGCTTCAGC AAGTACCAGT GCTTCAGCGT CAGCGTCGAC AAGTGCGTCG 660 

GCCTCAACCA GTGCATCTGA ATCGGCATCA ACCAGTGCGT CAGCCTCAGC AAGTACTAGT 720 

GCATCGGCTT CAGCATCAAC CAGTGCCTCG GCTTCAGCGT CAACCAGTGC GTCAGCTTCA . 780 

GCAAGTACCA TGTGCTTCAT GTCTCAG 807 



(2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373; 



CATCGGCTTC 


AGCATCAACG 


AGTGCGTCCG 


CTTCAGCAAG 


TACTACCGCC 


TCAGCCTCAG 


60 


CGTCAACAAG 


TGCATCGGCT 


TCAGCGTCAA 


CGAGTGCGTC 


TGAGTCAGCA 


TCAACGAGTG 


120 


CGTCACCTCA 


GCAAGCACAT 


CAGCTTCTGA 


ATCTGCATCA 


ACCAGTGCGT 


CACCTCAGCA 


180 


TCGACAAGCG 


CCTCAGCTTC 


AGCAAGTACC 


AGTGCGTCAC 


CTCAGCGTCG 


ACAAGTGCGT 


240 


CGGCTTCAGC 


AAGTACCAGT 


GCGTCAsCTC 


AGCAAGTACC 


AGTGCGTCAC 


CTCAGCGTCG 


300 


ACAAGTGCGT 


CGGCCTCAAC 


CAGTGCATCT 


GAATCGGCAT 


CAACCAGTGC 


GTCACCTCAG 


360 


CAAGTACTAG 


TGCATCAGCT 


TCAGCATCAA 


CGAGTGCATC 


GGCTTCAGCA 


TCAACCAGTG 


420 


CATCAGAGTC 


AGCAAGTACC 


AGTGCGTCAG 


cTTCCGCATC 


AACAAGTGCC 


TCGGCTTCAG 


480 


CAAGTACTAG 


CGCCTCAGCC 


TCAGCGTCAA 


CAAGTGCTTC 


AGCTTCCGCG 


TCAACCAGCG 


540 


CCTCGGCCTC 


AGCAAGTATC 


TCAGCGTCTG 


AATCGGCATC 


AACAAGTGCC 


TCGGCTTCAG 


600 


CATCAACGAG 


TGCATCAGTC 


TCAGCAAGCA 


CCAGTGCGTC 


GGCCTCAGCA 


AGCACCAGCG 


660 


CGTCTGAATC 


CGCATCAACC 


AGTGCCTCAG 


CTTCAGCAAG 


TACCTCAGCA 


TCTGAATCAG 


720 


CATCAACAAG 


TGCATCGGCT 


TCAGCAAGCA 


CAAGTGCTTC 


AGCCTCAGCA 


AGTATCTCAG 


780 


CGTCTGAATC 


GGCATCAACG 


AGTGCGTCCG 


CTTCAGCAAG 


TACTAGCGCC 


TCAGCATCAG 


840 


CGTCAACAAG 


TGCTTCGGCT 


TCAGCGTCAA 


CGAGTGCGTC 


TGAGTCAGCA 


TCAACGAGTA 


900 


CGTCAGCCTC 


AGCAAGCACA 


TCAGCTTCTG 


AATCTGCATC 


AACCAGTGCG 


TCAGCCTCAG 


960 


CATCGACAAG 


CGCCTCAGCT 


TCAGCAAGTA 


CCAGTGCGTC 


AGCCTCAGCA 


AGTACCAGTG 


1020 


CTTCAGCCTC 


AGCGTCGACA 


AGTGCGTCGG 


GCTCAACCAG 


TGCATCTG 




1068 



(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

CAGCATCAAC GAGTGCTTCA GTTTCAGCGT CAACCAGTGC CTCTGAATCA GCTTCAACAA 60 

GTGCCTCGGC TTCAGCAAGC CCCAGTGCGT CGGCTTCAGC AAGTACTAGT GCATCGGCTT 120 

CAGCATCGAC AAGTGCGTCT GAATCGGCAT CAACCAGTGC TTCGGCTTCA GCATCAACGA 180 

GTGCGTCAGC CTCAGCAAGC ACATCAGCTT CTGAATCTGC ATcAACCAGT GCGTCCGyTT 240 
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CAGCGTCAAC CAGTGCGTCG GCTTCAGCGT CGACAAGTGC TTCGGCTTCA GCATCAACGA 300 

GTGCGTCGGC CTCAGCAAGC GCAAGTACCT CAGCGTCAGC TTCCGCCTCA ACCAGTGCGT 360 

CGGCTTCAGC AAGCACAAGT GCGTCAGCCT CAGCAAGTAT CTCAGCGTCT GAATCGGCAT 420' 

CAACGAGTGC GTCTGAGTCA GCATCAACGA GTACGTCAGC CTCAGCAAGC ACATCAGCTT 480 

CTGAATCTGC ATCAACCAGT GCGTCAGCCT CAGCATCGAC AAGCGCCTCA GCTTCAGCAA 540 

GTACCAGTGC TTCAGCCTCA GCGTCGACAA GTGCGTCGGC CTCAACCAGT GCATCTGAAT 600 

CGGCATCAAC CAGTGCGTCA ^20 
(2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 



GTATTGGGGC 


GCCCCAACCT 


CTATGTGACT 


ACGGATTATT 


TCCTAGATTA CATGgGGATA 


60 


AACCATTTAG 


AAGAATTACC 


AGTGATTGAT 


GAGCTTGAGA 


TTCAAGCCCA 


AGAAAGCCAA 


120 


TTATTTGGTG 


AAAGGATAGA 


AGAAGATGAG 


AATCAATAAG 


TATATTGCCC 


ACGCAGGTGT 


180 


GGCCAGTAGG 


AGAAAAGCAG 


AAGAGCTGAT 


TAAGCAAGGC 


TTGGTGACGG 


TTAACGGCCA 


240 


AGTGGTGCGT 


GAACTAGCAA 


CCACTATCAA 


GTCAGGCGAC 


AAGGTCGAAG 


TTGAAGGTCA 


300 


ACCTATCTAC 


AACGAAGAAA 


AGGTCTACTA 


TCTGCTTAAC 


AAACCACGCG 


GTGTGATTTC 


360 


CAGTGTGACA 


GATGATAAGG 


GTCGCAAGAC 


GGTTGTCGAC 


CTCTTGCCCA 


ATGTCAAAGA 


420 


GCGTATTTAC 


CCTGTGGGTC 


GTTTGGACTG 


GGATACATCA 


GGTGTCTTGA 


TTTTGACCAA 


480 


TGATGGGGAC 


TTTACAGACG 


AGATGATTCA 


CCCTCGTAAT 


GAGATTGACA 


AGGTTTATGT 


540 


CGCGCGTGTT 


AAAGGTGTGG 


CCAATAAGGA 


CAATCTCCGC 


CCCTTGACCC 


GTGGTCTTGA 


600 


GATTGATGGT 


AAGAAAACCA 


AGCCATAATA 


TATAGGTTTT 


GTAGCCTCTA 


CACCATAAAT 


660 


ATTTGCTAAT 


AAAAATACTG 


TATTATTACC 


CTCTTAAGGT 


GCGAAATTAT 


TCAAGTTCTT 


720 



(2) INFORMATION FOR SEQ ID NO: 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

CGCCATTTCC CATCGTACCG CCGAAAATCC CAGCGCCTCA GCCATCAAAT ATCCTATCAA 60 

CGTTCTCAAA AAAAGTGACC GCTCTCTCAT CATGTTTCCA AGTGGTAGCC GCCACTCAAA 120 

CGATGTCAAG GGGGGCGCAC ACTskATTGC CAAAATGGCC AAGGTCCGTA TCATGCCGGT 180 

TACCTACACC GGTCCCATGA CTTTGAAGGG CTTGATTAGC CGTGAACGTG TCGATATGAA 240 

CTTTGGAAAT CCAATCGATA TCTCAGATAT CAAGAAAATG AATGATGAAG GCATTGAAAC 300 

AGTCGCCAAT CGTATTCAAA CAGAATTCCA ACGTCTGGAC GAAGAAACGA AACAATGGCA 360 

CAATGATAAA AAACCAAATC CACTCTGGTG GTTTATCCGC ATCCCTGCCC TCATCCTTGC 420 

TATTATCCTC GCTATCCTAA CCATCATCTT TAGCTTTATC GCAAGCTTCA TCTGGAACCC 480 

AGATAAGAAA AGAGAAGAAC TTGCATAGAA GAAATGAACC TTGGCCAAAC AGCTAAGGTT 540 

TTCATTTATA TAGTAGATTG GwACTAGAAT AGTACACCTC TACTTCTAAA ACATTTTTAG 600 

AAATCGATTT GACTGTCCTG ATCGATTTGT CCTAATCTTA TTTCAATT 648 
(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

GTGCATCGCT TTCAGCATCG ACAAGTGCGT CTGAATCGGC ATCAACGAGT GCTTCGGCTT 60 

CAGCATCAAC GAGTGCGTCA GCTTCAGCAA GCACATCAGC TTCTGAATCT GCATCAACCA 120 

GTGCGTCCGC TTCAGCGTCA ACCAGTGCGT CGGCTTCAGC GTCGACAAGT GCTTCGGCTT 180 

CAGCATCAAC GAGTGCGTCG GCCTCAGCAA GCGCAAGTAC CTCAGCGTCA GCTTCCGCCT 240 

CAACCAGTGC GTCCGCTTCA GCAAGCACAA GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT 300 

CTGAATCGGC ATCAACGAGT GCGTCGGCCT CAGCAAGCGC AAGTACCTCA GCGTCAGCTT 360 

CCGCCTCAAC CAGTGCGTCG GCTTCAGCAA GCACAAGTGC GTCAGCCTCA GCAAGTATCT 420 

CAGCGTCTGA ATCGGCATCA ACGAGTGCGT CTGAGTCAGC ATCAACGAGT ACGTCAGCCT 480 

CAGCAAGCAC ATCAGCTTCT GAATCGGCAT CAACCAGTGC GTCAGCCTCA GCATCGACAA 540 

GCGCCTCAGC TTCAGCAAGT ACCAGTGCTT CAGCCTCAGC GTCGACAAGT GCGTCGGCCT 600 

CAACCAGTGC ATCTGAATCG GCATCAACCA GTGCGTCAGC CTCAGCAAGT ACTAGTGCAT 660 
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CAGCTTCAGC ATCAACGAGT GCATCGGCTT 
(2) INFORMATION FOR SEQ ID NO: 378: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



60 
120 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 
CGAGATTCTC TGGAGTTATG GATGTCGTTC CAATATGTGC ACGTTGGAAT GTTAGTGCTT 
ATATGGGGGG AACAGAATCC TCTCTTGATT GAAGACAAGC TAGTCATTAG GCTGGTTTGT 
CTTTTTGTCA ACTGTAGTGG GTTGATATAA TAGTATTAGT . GAGTGGGATA AAAGTTTCAT 180 
TTAGTTTATT CAGTACAAAT TTAACGGGTC AAGATTTATA TACTAGTGGT GTTTTTGGGG 240 
CTGAGAGAAG TATCTTGATT TTATGTGTGG TTTTTATACT TACAGTTGTT CTGCTCCAAA 300 
GAGCTTGTAG AGAAGAATTA GCTCATAAAG GAGATTGATT ATTTTGATAT CAAAAAAATG 3 60 

CACAGGATAA CCTGATGCAT TTTTTTAGCG ACAATGCTTG CTACTTCCTT CTGTCGAATT 
TAGACAATTT TAAACCCCAA TTATTCACCC CAAATCTAAA AACCATCCAG AATCCTTGCC 
TTAGCTTAGA TCCTGGATGG TTTCTTTTTT CACCCAATGG GTGTTTTTTA CTAGACAAAA 
AAGAGTTTCC CCTTTATGGT ATAAGTGTAG AAAAAAACAC AAAAAGAAAG GAAACTCACA 
TGAACAGTTT ACCAAATCAT CACTTCCAAA ACAAGTCTTT TTACCAACTA TCTTTCGATG 
GAGGTCATTT AACCCAGTAT GGTGGTCTTA TCTTTTTTCA GGAACTTTTT TCCCAGTTGA 
AACTAAAAGA GCGGATTTCT AAGTATTTAG TAACGAATGA CCAACGCCGC TACTGTCGTT 
ATTCGGATTC AGATATCCTT GTCCAGTTCC TCTTTCAACT GTTAACAGGT TATGGAACGG 840 
ACTATGCTTG TAAAGAATTG TCAGCTGATG CCTACTTTCC AAAATTATTG GAAGGAGGGC 900 
AGCTTGCTTC ACAGCCAACC TTATCCCGTT TTCTTTCCAG AACTGACGAG GAAACAGTCC 960 
ATAGTTTGCG ATGCCTCAAC CTTGAATTGG TCGAATTCTT TTT 
(2) INFORMATION FOR SEQ ID NO: 379: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{D) TOPOLOGY: linear 



420 
480 
540 
600 
660 
720 
780 



1003 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 
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CCGATGATTC TGATTGGTTT GCTCTTTACT TTGCTGGGAA TTTTGAGGTA GATCTATGAT 
TGAAATACTA ATTGTTTTAG CTATTATCCT ATCTCTTGCT TTGATTGTAT TGGTAACTAT 
ACAACCCCGT CAAAATCAAC TATTTTCCAT GGATGCCACT AGTAATATTG GTAAACCAAG 
CTACTGGCAG AGCAACACCT TGGTCAAGGT GCTCACTTTA TTGGTGAGTT TGGCTTTATT 240 
TATTCTACTA TTAACCTTTA TGGTGATTAC TTATAAATAA AAGAAAACTT CAGATATTCA 
CCTTTTGTGG ATTGGTCTGA AGTTTTCTTT TTTATACTCA ATGAAAATCA AAGAGCAAAC 
TAGGAAGCTA GCCGCAckGC TCAAAACACC GTTTTGAGGT TGTAGATATA ACTGACGAGc 
GACTCAAAAC ACCGTTTTGA GGTTGTAGAT ATAACTGACG AGcGACTCAA AACACCGTTT 
TGAGGTTGTG GATAGAACTG ACGAGcGACT CAAAACACCG TTTTGAGGTT GTGGATAGAA 
CTGACGAAGT CGcTCAAAAC ACCGTTTTGA GGTTGTGGAT AGAACTGACG AAtgctCAAA 
ACACCGTTTT GAGGTTGTGG ATAGAACTGA CGAAGCgaaC ATATATACAG CAAGGCGACG 
CTGACGTGGT TTGAAGAGTA TTACTGTCTA TATTTTTGGT AAAAATCAAC TTTTACTTGG 
ATGAAGGTTT TTTTTTTT 
(2) INFORMATION FOR SEQ ID NO: 380: 

(i) SBQUFNCE CHJU5ACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 
CCGTCTTATC AAAGAGGTTA ACAAAGGCAC CAAATTTCTC GATACGAACG ACTTTAGCAC 60 
GGTAAACTTC ATCCACTTTG GCTTCACGAA CCAAACCAGC AATAATTTCT TTGGCACGGT 120 
TAATAGCATC TTGGTCACTA GAGTAGATAG ACACATTTCC TTCTTCGTCT ATATCAATCT 
TAACACCTGT TTCAGCGATA ATCTTGTCGA TGGTTTCTCC ACCCTTACCG ATGACAATCT 
TAATCTTGTC CACATCAATC TTGATCGTAT CAATTTTCGG AGCAGTTGGA GCCAATTCTG 
GACGAACTTC TGGAATGGTT GCTTCAATGA CATCAAGGAT TTCAAAACGC GCTTTCTTGG 
CTTGAGCAAG AGCCTCCGTC AAGATTTCTG CAGTAATCCC TTGAATCTTG ATATCCATTT 420 
GAAGGGCTGT AATCCCATCA CGAGTACCTG CAACCTTGAA GTCCATATCT CCAAAGTGAT 
CTTCCAAACC TTGGATATCT GTCAATACTG TGTAGTTATT TCCATCTGAG ATAAGCCCCA 
TAGCAATACC AGCTACTGGC GCCTTGATTG GCACACCACC AGCCATAAGG GCAAGAGTTC 



180 
240 
300 
360 



480 
540 
600 
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CCGCACAGAT AGAAGCTTGA GATGAAGAAC CGTTTGATTC CAAAACTTCT GCTACTAGAC 
GGATAGCGTA GGGGAATTCT TCCAAGCTTG GCAGG 
(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 691 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

GACATCTTAT CTAAATACAT GCTAATATAT TTAGATACAA ACATTCCAAC TTGATAATTT 60 

TCACTCATCT TTCATCATTC CTTATACAAC TATGCAGTAT AAATAGAATA GTTTTCTCAT 120 

CAGAATGAGA CTATTTTAAT ATTAGATCCC CAATTATTCA CCCCAAATCT AAAAACCATC 180 

CAGAATCCTT GCCTTAGCTT AGATCCTGGA TGGTTTCTTT TTTCACCCAA TGGGTGTTTT 240 

TTACTAGACA AAAAAGAGTT TCCCCTTTAT GGTATAAGTG TAGAAAAAAA CACAAAAAGA 300 

AAGGAAACTC ACATGAACAG TTTACCAAAT CATCACTTCC AAAACAAGTC TTTTTACCAA 3 60 

CTATCTTTCG ATGGAGGTCA TTTAACCCAG TATGGTGGTC TTATCTTTTT TCAGGAACTT 420 

T'T-TTCCCAGT TGAAAC-TAAA AGAGCGGATu" TCT'AAGtATt TAGTAACGAA TGACCmACGC 480 

CGCTACTGTC GTTATTCGGA TTCAGATATC CTTGTCCAGT TCCTCTTTCA ACTGTTAACA 540 

GGTTATGGAA CGGACTATGC TTGTAAAGAA TTGTCAGCTG ATGCCTACTT TCCAAAATTG 600 

TTGGAAGGAG GGCAGCTTGc TTCACAGCCA ACCTTATCCC GwTTTCTTTC CAGAACTGAC 660 

GAGGAAACAG TCCATAGTTT GCGATGCCTC a 691 

(2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

ATCTCTCTGC GTAATGGTCC TCAGATAACT CTGATGATGT GTGGCGATAT AGAACTGAGC 60 

CAAGTTATGC CTAAAGGGCC TTAGGAATAG GAGCTTTCAC AAGCTTATCC AGATGATTAT 120 

CTTTTACTCG TTATGGACAA TGCTATATGG CATAAATCAA GTACCTTAAA GATTCCGACT 180 

AATATTGGCT TTGCATTTAT TCCTCCATAC ACACCAGAGA TGAACCCCAT TGAACAAGTG 240 
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TGGAAAGAGA TTCGTAAACG TGGATTTAAG AATAAAGCCT TTCGAACTTT GGAAGATGTC 300 

ATACAAGGAC TGGAGAAGGA GGTGATAAAG TCCATCGTTA ATCGGAGACG GACTAGAATG 360 

CTTTTTGAAA ACAGATGAGT ATAAAAAGAA AGTCCTCATT TCAATAGAAA TCACGACTTT 420 

CTGATGAATT TATAGTAAAA TGAAATAAGA ACAGGATAGT CAAATCGATT TCTAACAATG 4 80 

TTTTAGAAGC AGAGGTGTAC TATTCTAGTT TAAATCCACT ATATTTGGGG AGTGATAGAA 540 

AAGCCCTTCA TCAGCCAATC TACTTGTTCA GGTGCGAGAG CTTTGACATC CTTTTCTGTA 600 

CTGGACCAAG TCAGTTTTCC GTTCTCAAAG CGTTTATATA ATATCCAAAA TCCTTGACCA 660 

TCCCAGTAAA GAACTTTAAA GCGGTCTTTA CGTCCACCAC AAAAGAGAAA GACTTGATCG 720 



GAGAAAGGAT CCAATTCAAA GTGGGTTTGG 
(2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 

TCAAATTCTT CGTGGTCCGC ATATCTnTCT TCGTACACGG CAGTCACTTG GTCTTTCACT 60 

ACTCGAGTCG CAGCTTCACG GGCCAATTTC TCTTCTACTT GAACTGCCTT TTGGAGGTCA 120 

CTGTTGTAGG CTGCAATGAT TTCAGCTTGC AATTCAGCAT CCACGTGAAG CAATTCCACT 180 

TCTGCTTTTT CTTTACCGAC AGCAGCAACG ATTTCTTCTT GGAAGGCAAT CAATTCTTTG 240 

ACAGCTTCGT GCCCTTTAAG GAGCGCTTCC AACATGATTT CTTCTGACAA TTCTTTGGCA 300 

CCAGACTCTA CCATGTTGAT AGOGTGCTTG GTTCCAGCTA CTGTCAATTC AAGAAGAGAT 3 60 

TGCTCTGCTT GTTCTTGACT TGGGTTGATG ATGATTTGGC CATCTACATA TCCCACTTGT 420 

ACCCCAGCAA TTGGTCCGTC AAATGGAATA TCTGAAATAG ACAGTGCCAA AGATGAACCA 4 80 

AACATAGCAG CCATTGGTGC AGATGCATTT TCATCATAAG AAAGCACTGT ATTGATGACT 540 

TGGACTTCAT TACGGAAACC TTCCGCAAAC ATAGGACGAA TCGGACGGTC AATCAAACGC 600 

GCTGTCAAGG TCGCATCTGT TGAAGGACGT CCTTCACGTT TCATAAAGCC ACCAGGAAAC 660 

TTCCCAGCCG CATACATTTT TTCTTCGTAG TTGACTTGGA GTGGGAAGAA ATCCTCAGTT 720 

GCCATTTTCT GGGGATCC 738 
(2) INFORMATION FOR SEQ ID NO: 384: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 

CCCCCTATTT ACCGTGGACT AAAGTTGTAC AAGAAAAGTG CAAATAAGAA ATCTCCAGAT 60 

TAGGAACTAT ATATGAGTTC TCTAGTCTGG AGATTTTTCA ATAGACTTCG TTATTGGGCG 120 

GTTACTTTCG AAACTTTGAA AACTTCAAAA AACGGATTTT TATCGCTTTC AAATTCTTTT 180 

GGGGTCAAAC TCAGTAACTT ATTCGCCTTG TAGACTTCAT GACGCTCAGG GTATACTTTC 24 0 

AAGGTCCCAA ATAGCCAAGA ATCGTCAGCG ATATTATCTG AATCATCTCC TTCTTGTTCT 300 

CCTTTAGTTC GCCTGAGGAC AGCCTTGACA CGCGCCAGAA TTCTCTAGGG CTAAAAGGCT 360 

TGGTCAGGTA GTCATCAGCC CCTAATTCCA AGGCCAAAAC CTTATCAAAT TCATCACTTT 420 

TCGCAGAAAC CATCATAATT GGAGTTTTGA CGCCTTTGGC TCTCAGCCGC TTACAAACTT 480 

CCATGCCATC TAATTGTGGT AACATGATAT CAAGCAAGAT AAAATCAAAG GGTTCTGTTT 540 

CTGCCAAAGC TAAGGCCTTC CGTCCATTTG TCACCAATTG AGTAGAAAAG CCTTCCTTAC 600 

TTAAATGGTA GTCAAGCAAT TTCAGAATGT GTTCTTCATC ATCCACTAAT AAGACTT 65 V 
(2) INFORMATION FOR SEQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

CCGCATCAGC ATCAACGAGT GCATCGGCTT CACGTCAACC AGTGCATCAG TCTCAGCAAG 60 

CACCAGTGCG TCGGCTTCAG CATCAACGAG TGCCTCAGCC TCAGCAAGTA TCTCAGCGTC 120 

TGAATCGGCA TCAACGAGTG CGTCAGCTCA GCAAGTACTA GTGCATCGGC TTCAGCAAGC 180 

ACCAGTGCGT CGGCTTCAGC ATCAACCAGT GCCTCAGCCT CAGCAAGTAT CTCAGCGTCT 240 

GAATCGGCAT CAACGAGTGC GTCACCTCAG CAAGTACTAG TGCATCAGCA TCAGCATCAA 300 

CGAGTGCATC GGCTTCAGCA AGTACCAGCG CCTCAGCTTC AGCAAGCACC AGTGCGTCAC 360 

CTCAGCAAGT ACCAGCGCCT CAGCCTCAGC AAGCACCAGT GCCTCAGCTT CAGCAAGTAC 420 

CAGTGCGTCA CCTCAGCATC GACAAGTGCG TCGGCTTCAG CAAGTACCTC AGCGTCTGAA 480 
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TCAGCATCAA CGAGTGCGTC AGCTTCAGCA TCAACCAGTG CCTCAGCCTC AGCAAGTATC 540 
AGTGCGTCAG CTTCAGCATC AACGAGTGCG TCAGCTGCAG CAAGTA 586 
(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
tD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 

CGTCGGCTTC AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT TCAGCTTCAG 60 

CAAGTACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCTTC AGTCTCAGCG TCAACCAGTG 120 

CCTCTGAATC CGCATCAACA AGTGCCTCGG CTTCAGCAAG CACCAGTGCT TCGGCTTCAG 180 

CGTCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTGCGTC ACCTCAGCAA GCACATCAGC 240 

TTCTGAATCT GCATCAACCA GTGCGTCAGC TTCCGCATCA ACAAGCGCCT CGGCCTCAGC 300 

AAGTACAAGT GCTTCAGCCT CAGCATCAAC CAGTGCATCA GCTTCAGCCT CAACAAGTGC 360 

TTCAGCCTCA GCGTCAACCA GTGCCTCGGC TTCAGCAAGT ACCAGTGCGT CAGTTcAGCA 420 

AGCACAAGTG CGTCAATTTA GCATCAACCA G 451 
(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

TCTCAGCAAG CACCATTGCG TCGGCTTCAT CAAGCACCAG CGCGTTTGAA TCCGCATCAA 60 

CGAGTGCTTC AGCTTCAGCC AAGTTACCTC AGCATCTGAA TCAGCATCAA CAAGTGCATC 120 

GGCTTCAGCA AGCACAAGTG CTTCAGCtCA GCAAGTATCT CAGCGTCTGA ATCGGCATCA 180 

ACGAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCAT CAGCGTCAAC AAGTGCTTCG 240 
GCTTCAGCGT CAACGAGTGC GTCTGAGTCA GCATCAACGA GTACGTCAGC CTCAGCAAGC . 300 

ACATCAGCTT CTGAATCTGC ATCAACCAGT GCGTCAGCCT CAGCATCGAC AAGCGCCTCA 360 

GCTTCAGCAA GTACCAGTGC GTCAGCCTCA GCAAGTACCA GTGCTTCAGC CTCAGCGTCG 420 
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ACAAG 

(2) INFORMATION FOR SEQ ID NO: 388: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 88: 
AGAGGATCCC CGGATCCTCA GTCGCTGAGA TAACTCCTTT GGGCTTGTTC ATCATGTAGT 60 

AGACAAACTC TTCATACTCC AACACTTGCC CATTTTATGC GAATCTCATC TATTTTTTCT 120 

TTTTTTTGCA ATTTAGCTGA TTTTTCTTTT TTACCATTTA CAGTCACGCG CCCAGCCTTG 180 

AGCAAGTTTT TGACCTCAGT CCGACTTCCC ACCGCACAGG CAACTAAAAA TTTATCTAAT 240 

CTCATAGAAC TATTATATCA TATCAAAAGG AGGCTAGTAC AATGACCAAC CTCCTTTTCG 300 

TTTCATACTC TTCAAAAATC TCTTCAAACC GCGTCAACGT CGCCTTGCCG TATATATGTT 3 60 

ACTGACTTCG TCAGTTCTAT CTGCAACCTC AAAACAGTGT TTTGAGCTGA CTTCGTCAGT 420 

TCTATCTGCA ACCTCAAAGC AGTGCTTTGA GCATCCTGCG GCTAGTTTCC kAGTkTGCTC 480 

TTTGATTTwC ATTGAGTATC AGATTTAGGA AATTAACTTC CTCGkCTCCA AAAAAJcAGCT 54 U 

AAAACAATCA AGGCTCCTAA AATCGCTGGG AT 572 
(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: do\ible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 
CAACAAGTGC CTCGGCTTCA GCATGCACAA GTGCTTCAGC TTCAGCATGT ACCTGAGCGT 
CTGAATCAGC ATCAACGTGT GCGTCCGCTT CAGCATGTAC TGCTGCCTCA GCATCAGCGT 
CAAcAwGTGC TTCGGCTTCA GCGTCAACGA GTGCGTCTGA GTCAGCATCA ACGAGTACGT 180 



60 
120 



240 
300 



CAGCCTCAGC AAGCACATCA GCTTCTGAAT CTGCATCAAC CAGTGCGTCA GCCTCAGCAT 
CGACAAGCGC CTCAGCTTCA GCAAGTACCA GTGCGTCAGC CTCAGCAAGT ACCAGTGCTT 
CAGCCTCAGC GTCGACAAGT GCGTCGGCCT CAACCAGTGC ATCTGAATCG GCATCAACCA 360 
GTGCGTCAGC CTCAGCAAGT ACTAGCGCCT CAGCCTCAGC ATCAACGAGT GCGTCCGCTT 420 
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CAGCAAGTAC TAGTGCATCA GCATCAGCAT CAACGAGTGC ATCGGCTTCA GCAAGTACCA 480 
GCGCCTCAGC TTCAGCAAGC ACCGG 505 
(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 



GCTAAGACTA 


CCTCATTAGG 


GGCATAGGCT 


GCTAAAATAA 


CTGCAGCTGT 


GGTTAATGAC 


60 


AATACTGTAC 


TTTTTTTCAT 


TTTAATTCCT 


TACATATTTA 


TATAACTTCC 


AATAGATAAT 


120 


AAACTTTAAC 


TTTGCTAGCC 


TTTGTTATAA 


AAAGTTTTAC 


TAAGTATTAT 


CTAGGAAATA 


180 


GAGTAGTACA 


TTTATATATA 


ATTGTTATCT 


CTCTATAAAA 


ACAGTATATC 


ATTTAAAAAA 


240 


ATTTAAGTCA 


AAAAAATTAA 


CATTAGTTAA 


TTTATTTTTT 


AGCACACATT 


AAAAAATAAG 


300 


ATTAGTACTC 


AATGAAAATC 


AAAGAGCAAA 


CTAGGAAACT 


AGCCGCAGAT 


TGCTCAAAAC 


360 


AGTGTTTTGA 


GGTTGTAGAT 


GGAATGACGT 


AGTCAGCTCA 


AAACACTGTT 


TTGAAGTTGT 


420 


GGATAGAACT 


GACGAAGTCG 


GTACCGA 








447 



(2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 



AGCACTTGTC 


GTTGAATTCT 


ACAACAAAAT 


GTTGTAATAT 


TTTATTGAAT 


AAGATAGGCC 


60 


TTGATATTAA GCACTTTGGG 


ACGTTCTCCC 


TTAGTGCTTT 


TTTGATTTCT 


CTTAGTATCC 


120 


AGCTATAATC 


GTTGAGACAT 


AACTAGACCG 


ATATAGTCCA 


AAGTGATATA 


GTAAAATGAA 


180 


CCAAAAATAG 


TACACAATGT 


GGTATAATCC 


TTTTATGGCA 


TATTCAATAG 


ATTTTCGTAA 


240 


AAAAGTTCTC 


TCTTATTGTG 


AGCGAACAGG 


TAGTATAACA 


GAAGCATCAC 


ACGTTTTCCA 


300 


AATCTCACGT 


AATACCATTT 


ATGGCTGGTT 


AAAGCTAAAA 


GAGAAAACAG 


GAGAGCTAAA 


360 


CCACCAAGTA 


TAGTGTATTG 


AATCTATAAC 


AGTACACCTT 


GGCTGCTAAA 


ATATTTCTAT 


420 
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AAATTAATTT GACTTTCCTG ATAGAGATGT TCACATCTTA TTTCAAACTA CTATAT/U^GT 4 80 

TCTATAATCT CTTTATAAGA TTTGCCCATC AGACAAAATA GAACGATTTG AAGGCGTTTA 540 

TGATATTTAG CTGTACGAGA GTCTTTTAAA AG 572 
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MISSING UPON TIME OF PUBLICATION 
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DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed by 
the applicant, any request made by a third party for the furnishing of a sample shall indicate the 
expert to be used. That expert may be any person entered on a list of recognized experts drawn 
up by the Danish Patent Office or any person approved by the applicant in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 

effected to an expert m the aft. The request to this effect shall be filed by the applicant with the 
Intemational Bureau before the expiration of 16 months from the priority date (preferably on the 
Form PUT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such a 
request has been filed by the applicant, any request has been filed by the applicant, any request 
made by a third party for the fUrnishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by the applicant in the individual case. 

UNITED KINGDOM 

The. applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
Intemational Bureau before the completion of the technical preparations for the Intemational 
publication of the application. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapse, the microorganism shall be made 
available as provided in Rule 3 1 F( 1 ) of the Patent Rules only by the issue of a sample to an 
expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever two dates occiirs earlier. 
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SINGAPORE 



The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for mtemational 
publication of the application. 



NORWAY 



The appUcant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the apphcant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegians Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the fiimishmg of a s^ple shall 
indicate the expert to be used. That expert may be any person entered on a list of recogmzed 
experts drawn up by the Norwegian Patent Office or any person approved by the applicant m the 



AUSTRALIA 

The applicant hereby gives notice that the fiimishing of a sample of a microorganism sh^ only 
be effected prior to tiie grant of a patent, or prior to the lapsing, refiisal or withdrawal of the 
application, to a person who is a skilled addressee without an interest m the mvention 
(Regulation 3.25(3) of the Australian Patents Regulations). 



FINLAND 



The applicant hereby requests that, until the application has been laid open to mspection 
(by the National Board of Patents and Registration), or has been fmally decided upon by the 
National Board of Patents and Registration without having been laid open to public mspection, 
the furnishing of a sample shall only be effected to an expert m the art. 

ICELAND 

The applicant hereby requests that, until the application has been laid open to ^^^^^^^'"^^^ 
(by the Icelandic Patent Office), or has been finally decided upon by the -^^J^^^ °f " 
without having been laid open to public inspection, the fumishmg of a sample shall only be 
effected in the art. 
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What Is Claimed Is: 

1 . Computer readable medium having recorded thereon the nucleotide 
sequence depicted in SEQ ID NOS: 1-391, a representative fragment thereof or a 
nucleotide sequence at least 95% identical to a nucleotide sequence depicted in SEQ 
ID NOS:l-391. 

2. Computer readable medium having recorded thereon any one of the 
fragments of SEQ ID NOS: 1-391 depicted in Tables 2 and 3 or a degenerate variant 
thereof. 

3. The computer readable medium of claim 1, wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (ROM), and CD-ROM. 

4. The computer readable medium of claim 3, wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (ROM), and CD-ROM. 

5. A computer-based system for identify ing fragments of the Streptococcus 
pneumoniae genome of conunercial importance comprising the following elements: 

a) a data storage means comprising the nucleotide sequence of SEQ ID 
NOS: 1-391, a repi^entative fragment thereof, or a nucleotide sequence at least 
95% identical to a nucleotide sequence of SEQ ID NOS: 1-391; 

b) search means for comparing a target sequence to the nucleotide sequence 
of the data storage means of step (a) to identify homologous sequence(s), and 

c) retrieval means for obtaining said homologous sequence(s) of step (b). 

6. A method for identifying commercially important nucleic acid fragments 
of the Streptococcus pneumoniae genome comprising the step of comparing a 
database comprising the nucleotide sequences depicted in SEQ ID NOS: 1-39 1, a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to a 
nucleotide sequence of SEQ ID NOS: 1-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence is not randomly selected. 
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7. A method for identifying an expression modulating fragment of 
Streptococcus pneumoniae genome comprising the step of comparing a database 
comprising the nucleotide sequences depicted in SEQ ID NOS: 1-391, a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to 
the nucleotide sequence of SEQ ID NOS: 1-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence comprises sequences known to 
regulate gene expression. 

8. An isolated protein-encoding nucleic acid fragment of the Streptococcus 
pneumoniae genome, wherein said fragment consists of the nucleotide sequence of 
any one of the fragments of SEQ ID NOS: 1-391 depicted in Tables 2 and 3, or a 
degenerate variaut thereof. 

9. A vector comprising any one of the fragments of the Streptococcus 
pneumoniae genome SEQ ID NOS: 1-391 depicted in Tables 2 and 3 or a 
degenerate variant thereof. 

10. An isolated fragment of the Streptococcus pneumoniae genome, 
wherein said fragment modulates the expression of an operably linked open reading 
frame, wherein said fragment consists of the nucleotide sequence from about 10 to 
200 bases in length which is 5* to any one of the open reading frames depicted in 
Tables 2 and 3 or a degenerate variant thereof. 

11. A vector comprising any one of the fragments of the Streptococcus 
pneumoniae genome of claim 8. 

12. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 8. 

13. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 10. 
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14. A method for regulating the expression of a nucleic acid molecule 
comprising the step of covalently attaching to said nucleic acid molecule a nucleic 
acid molecule consisting of the nucleotide sequence from about 10 to 100 bases 5' 

95 to any one of the fragments of the Streptococcus pneumoniae genome depicted in 

SEQ ID NOS: 1-391 and Tables 2 and 3 or a degenerate variant thereof. 

15. An isolated nucleic acid molecule encoding a homolog of any of the 
fragments of the Streptococcus pneumoniae genome of SEQ ED NOS: 1-391 and 

100 Tables 2 and 3, wherein said nucleic acid molecule is produced by a process 

comprising steps of: 

a) screening a genomic DNA library using as a probe a target sequence 
defined by any of SEQ ID NOS:l-391 and Tables 2 and 3, including fragments 
thereof; 

105 b) identifying members of said library which contain sequences that 

hybridize to said target sequence; and 

c) isolating the nucleic acid molecules from said members identified in step 

(b). 

no 

16. An isolated DNA molecule encoding a homolog of any one of the 
fragments of the Streptococcus pneumoniae genome of SEQ ED NOS: 1-391 and 
Tables 2 and 3, wherein said nucleic acid molecule is produced a process 
comprising steps of: 

1 15 a) isolating mRNA, DNA, or cDN A produced from an organism; 

b) amplifying nucleic acid molecules whose nucleotide sequence is 
homologous to amplification primers derived from said fragment of said 
Streptococcus pneumoniae genome to prime said amplification; 

c) isolating said amplified sequences produced in step (b). 

120 

17. An isolated polypeptide encoded by any of the fragments of the 
Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and depicted in Table 2 
and 3 or by a degenerate variant of said fragments. 



125 



18. An isolated polynucleotide molecule encoding any one of the 
polypeptides of claim 17. 
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19. An antibody which selectively binds to any one of the polypeptides of 
claim 17. 

20. A method for producing a polypeptide in a host cell comprising the 
steps of: 

a) incubating a host containing a heterologous nucleic acid molecule whose 
nucleotide sequence consists of any one of the fragments of the Streptococcus 
pn^M/Mowiae genome of SEQ ID NOS: 1-391 and depicted in Tables 2 and 3, under 
conditions where said heterologous nucleic acid molecule is expressed to produce 
said protein, and 

b) isolating said protein. 
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AGTGCTTCAG TCTCAGCGTC AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT 900 

TCAGCAAGCA CCAGTGCGTC GGCTTCAGCA AGTACTAGTG C 9^1 

(2) INFORMATION FOR SEQ ID NO: 3 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 869 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 

CAGCAAGTAC TAGTGCATCA GCTTCAGCAT CAACGAGTGC ATCGGCTTCT GCGTCAACCA . 60 

GTGCATCAGA GTCAGCAAGT ACCAGTGCGT CAGCTTCCGC ATCAACAAGT GCCTCGGCTT 120 

CAGCAAGCAC CAGTGCGTCG GCTTCAGCAA GTACTAGCGC CTCAGCCTCA GCGTCAACCA 180 

GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT CTGAATCGGC ATCAACGAGT GCGTCCGCTT 240 

CAGCAAGTAC TAGCGCCTCA GCCTCAGCGT CAACAAGTGC ATCGGCTTCA GCGTCAACCA 300 

GTGCGTCTGA ATCGGCATCA ACGAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCCT 360 

CAGCGTCAAC AAGTGCATCG GCTTCAGCAT CAACGAGTGC GTCCGCTTCA GCAAGTACTA 420 

GCC5CCn'CAG0 C-rCAGCGTCA ACAAGTGCAT CGGCTTCAGC GTCAACGAGT GCGTCTGAGT 480 

CAGCATCAAC GAGTGCGTCA GCCTCAGCAA GCACATCAGC TTCTGAATCT GCATCAACCA 540 

GTGCGTCAGC CTCAGCATCG ACAAGCGCCT CAGCTTCAGC AAGTACCAGT GCGTCAGCCT 600 

CAGCGTCGAC AAGTGCGTCG GCTTCAGCAA GTACCAGTGC GTCAGCCTCA GCAAGTACCA 660 

GTGCGTCAGC CTCAGCGTCG ACAAGTGCGT CGGCCTCAAC CAGTGCATCT GAATCGGCAT 720 

CAACGAGTGC GTCAGCCTCA GCAAGTACTA GTGCATCAGC TTCAGCATCA ACGAGTGCAT 780 

CGGCTTCAGC ATCAACCAGT GCATCAGAGT CAGCAAGTAC GAGTGCGTCA GnTTCCGCAT 840 
GCAACAAGTG CCTCGGCTTC AGCAAGTAC 
(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 
TCAACAAGTG CCTCAGCATC AGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 



60 
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TCAGCATCAG CATCAACCAG TGCATCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 120 

TCAACGAGTG CATCAGCATC AGCATCAACG AGTGCATCGG CTTCAGCGTC AACCAGTGCA 180 

TCAGTCTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCCTC AGCCTCAGCA 24 0 

AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 300 

TCGGCTTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CCAGTGCCTC AGCCTCAGCA 3 60 

AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 420 

TCAGCATCAG CATCAACGAG TGCATCGGCT TCAGCAAGTA CCAGCGCCTC AGCTTCAGCA 480 

AGCACCAGTG CGTCAGCCTC AGCAAGTACC AGCGCCTCAG CCTCAGCAAG CACCAGTGCC 54 0 

TCAGCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCGTCGA CAAGTGCGTC GGCTTCAGCA 600 

AGTACCTCAG CGTCTGAATC AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT 660 

TCAGCTTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC CGCTTCAGCA 720 

AGTACTAGCG CCTCAGCATC AGCGTCAACG 750 
(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 957 base pairs 

(B) TYPE: nucleic acid 

(C) STKAI^DEDNESS : uOUul€: 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 



CCGGAAAACA 


GCTCTGGCGC 


TTGGTCTTGC 


CCAGCGTATT 


GCTAGTGGTG 


ACGTGCCTGC 


60 


GGAAATGGCT 


AAGATGCGCG 


TGTTAGAACT 


TGATTTGATG 


AATGTCGTTG 


CAGGGACACG 


120 


CTTCCGTGGT 


GACTTTGAAG 


AACGCATGAA 


TAATATCATC 


AAGGATATTG 


AAGAAGATGG 


180 


CCAAGTCATC 


CTCTTTATCG 


ATGAACTCCA 


CACCATCATG 


GGTTCTGGTA 


GCGGGATTGA 


240 


TTCGACTCTG 


GATGCGGCCA 


ATATCTTGAA 


ACCAGCCTTG 


GCGCGTGGAA 


CTTTGAGAAC 


300 


GGTTGGTGCC 


ACTACTCAGG 


AAGAATATCA 


AAAACATATC 


GAAAAAGATG 


CGGCACTTTC 


360 


TCGTCGTTTC 


GCTAAAGTGA 


CGATTGAAGA 


ACCAAGTGTG 


GCAGATAGTA 


TGACTATTTT 


420 


ACAAGGTTTG 


AAGGCGACTT 


ATGAGAAACA 


TCACCGTGTA 


CAAATCACAG 


ATGAAGCGGT 


480 


TGAAACAGCG 


GTTAAGATGG 


CTCATCGTTA 


TTTAACCAGT 


CGTCACTTGC 


CAGACTCTGC 


540 


TATCGATCTC 


TTGGATGAGG 


CGGCAGCAAC 


AGTGCAAAAT 


AAGGCAAAGC 


ATGTAAAAGC 


600 


AGACGATTCA 


GATTTGAGTC 


CAGCTGACAA 


GGCCCTGATG 


GATGGCAAGT 


GGAAACAGGC 


660 
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AGCCCAGCTA ATCGCAAAAG AAGAGGAAGT ACCTGTCTAC AAAGACTTGG TGACAGAGTC 720 

TGATATTTTG ACCACCTTGA GTCGCTTGTC AGGAATCCCA GTTCAAAAAC TGACTCAAAC 780 

GGATGCTAAG AAGTATTTAA ATCTTGAAGC AGAACTCCAT AAACGGGTTA TCGGTCAAGA 840 

TCAAGCTGTT TCAAGCATTA GCCGTGCCAT TCGCCGCAAC CAGTCAGGGA TTCGCAGTCA 900 

TAAGCGTCCG ATTGGTTCCT TTATGTTCCT AGGGCCTACA GGTGTCGGGG TATCCGA 957 



(2) INFORMATION FOR SEQ ID NO: 372: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 
CAAAGCGCCT CAGCTTCAGC ATCAACAAGT GCGTCGGCTT CAGCATCAAC CAGTGCCTCG 60 

GCTTCAGCGT CAACCAGTGC GTCACATTCA GCAAGTACCA GTGCTTCAGT CTCAGCATCA 120 

ACAAGTGCTT CAGCCTCAGC ATCGACAAGT GCCTCGGCTT CAGCAAGCAC ATCAGCATCT 180 

GAATCAGCGT CAACCAGTGC TTCGGCTTCA GCAAGTACCA GTGCTTCAGC TTCAGCATCA 240 

ACCAGCGCCT CGGCCTCAGC AAGCACCTCA GCTTCTGAAT CGGCCTCAAC CAGCGCCTCG 3 CO 

GCCTCAGCAA GCACCTCAGC TTCTGAATCG GCCTCAACCA GCGCCTCAGC CTCAGCATCA 360 

ACGAGTGCTT CGGCTTCAGC AAGCACAAGC GCCTCGGGTT CAGCATCAAC GAGTACGTCA 420 

GCTTCAGCGT CAACCAGTGC TTCAGCCTCA GCATCAACAA GTGCGTCAGC CTCAGCAAGT 480 

ATCTCAGCGT CTGAATCGGC ATCAACGAGT GCGTCTGAGT CAGCATCAAC GAGTACGTCA 540 

GCCTCAGCAA GCACCTCAGC TTCTGAATCG GCCTCAACCA GTGCGTCAGC CTCAGCATCG 600 

ACAAGCGCCT CAGCTTCAGC AAGTACCAGT GCTTCAGCGT CAGCGTCGAC AAGTGCGTCG 660 

GCCTCAACCA GTGCATCTGA ATCGGCATCA ACCAGTGCGT CAGCCTCAGC AAGTACTAGT 720 

GCATCGGCTT CAGCATCAAC CAGTGCCTCG GCTTCAGCGT CAACCAGTGC GTCAGCTTCA 780 

GCAAGTACCA TGTGCTTCAT GTCTCAG 807 



(2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 

CATCGGCTTC AGCATCAACG AGTGCGTCCG CTTCAGCAAG TACTACCGCC TCAGCCTCAG 60 

CGTCAACAAG TGCATCGGCT TCAGCGTCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTG 120 

CGTCACCTCA GCAAGCACAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CACCTCAGCA 180 

TCGACAAGCG CCTCAGCTTC AGCAAGTACC AGTGCGTCAC CTCAGCGTCG ACAAGTGCGT 240 

CGGCTTCAGC AAGTACCAGT GCGTCAsCTC AGCAAGTACC AGTGCGTCAC CTCAGCGTCG 300 

ACAAGTGCGT CGGCCTCAAC CAGTGCATCT GAATCGGCAT CAACCAGTGC GTCACCTCAG 3 60 

CAAGTACTAG TGCATCAGCT TCAGCATCAA CGAGTGCATC GGCTTCAGCA TCAACGAGTG 420 

CATCAGAGTC AGCAAGTACC AGTGCGTCAG cTTCCGCATC AACAAGTGCC TCGGCTTCAG 480 

CAAGTACTAG CGCCTCAGCC TCAGCGTCAA CAAGTGCTTC AGCTTCCGCG TCAACCAGCG 540 

CCTCGGCCTC AGCAAGTATC TCAGCGTCTG AATCGGCATC AACAAGTGCC TCGGCTTCAG 600 

CATCAACGAG TGCATCAGTC TCAGCAAGCA CCAGTGCGTC GGCCTCAGCA AGCACCAGCG 660 

CGTCTGAATC CGCATCAACC AGTGCCTCAG CTTCAGCAAG TACCTCAGCA TCTGAATCAG 720 

CATCAACAAG TGCATCGGCT TCAGCAAGCA CAAGTGCTTC AGCCTCAGCA AGTATCTCAG 780 

CGTCTGAATC GGCATCAACG AGTGCGTCCG CTTCAGCAAG TACTAGCGCC TCAGCATCAG 840 

CGTCAACAAG TGCTTCGGCT TCAGCGTCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA 900 

CGTCAGCCTC AGCAAGCACA TCAGCTTCTG AATCTGCATC AACCAGTGCG TCAGCCTCAG 960 

CATCGACAAG CGCCTCAGCT TCAGCAAGTA CCAGTGCGTC AGCCTCAGCA AGTACCAGTG 1020 

CTTCAGCCTC AGCGTCGACA AGTGCGTCGG GCTCAACCAG TGCATCTG 1068 
(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

CAGCATCAAC GAGTGCTTCA GTTTCAGCGT CAACCAGTGC CTCTGAATCA GCTTCAACAA 60 

GTGCCTCGGC TTCAGCAAGC CCCAGTGCGT CGGCTTCAGC AAGTACTAGT GCATCGGCTT 120 

CAGCATCGAC AAGTGCGTCT GAATCGGCAT CAACGAGTGC TTCGGCTTCA GCATCAACGA 180 

GTGCGTCAGC CTCAGCAAGC ACATCAGCTT CTGAATCTGC ATcAACCAGT GCGTCCGyTT 240 



wo 98/18931 



PCT/US97/19588 



1388 

CAGCGTCAAC CAGTGCGTCG GCTTCAGCGT CGACAAGTGC TTCGGCTTCA GCATCAACGA 300 

GTGCGTCGGC CTCAGCAAGC GCAAGTACCT CAGCGTCAGC TTCCGCCTCA ACCAGTGCGT 360 

CGGCTTCAGC AAGCACAAGT GCGTCAGCCT CAGCAAGTAT CTCAGCGTCT GAATCGGCAT 420 

CAACGAGTGC GTCTGAGTCA GCATCAACGA GTACGTCAGC CTCAGCAAGC ACATCAGCTT 480 

CTGAATCTGC ATCAACCAGT GCGTCAGCCT CAGCATCGAC AAGCGCCTCA GCTTCAGCAA 540 

GTACCAGTGC TTCAGCCTCA GCGTCGACAA GTGCGTCGGC CTCAACCAGT GCATCTGAAT 600 

CGGCATCAAC CAGTGCGTCA ^20 
(2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 



GTATTGGGGC GCCCCAACCT CTATGTGACT ACGGATTATT TCCTAGATTA CATGgGGATA 


60 


AACCATTTAG 


AAGAATTACC 


AGTGATTGAT 


GAGCTTGAGA 


TTCAAGCCCA 


AGAAAGCCAA 


120 


TTATTTGGTG 


AAAGGATAGA 


AGAAGATGAG 


AATCAATAAG 


TATATTGCCC 


ACGCAGGTGT 


ibO 


GGCCAGTAGG 


AGAAAAGCAG 


AAGAGCTGAT 


TAAGCAAGGC 


TTGGTGACGG 


TTAACGGCCA 


240 


AGTGGTGCGT 


GAACTAGCAA 


CCACTATCAA 


GTCAGGCGAC 


AAGGTCGAAG 


TTGAAGGTCA 


300 


ACCTATCTAC 


AACGAAGAAA 


AGGTCTACTA 


TCTGCTTAAC 


AAACCACGCG 


GTGTGATTTC 


360 


CAGTGTGACA 


GATGATAAGG 


GTCGCAAGAC 


GGTTGTCGAC 


CTCTTGCCCA 


ATGTCAAAGA 


420 


GCGTATTTAC 


CCTGTGGGTC 


GTTTGGACTG 


GGATACATCA 


GGTGTCTTGA 


TTTTGACCAA 


480 


TGATGGGGAC 


TTTACAGACG 


AGATGATTCA 


CCCTCGTAAT 


GAGATTGACA 


AGGTTTATGT 


540 


CGCGCGTGTT 


AAAGGTGTGG 


CCAATAAGGA 


CAATCTCCGC 


CCCTTGACCC 


GTGGTCTTGA 


600 


GATTGATGGT 


AAGAAAACCA 


AGCCATAATA 


TATAGGTTTT 


GTAGCCTCTA 


CACCATAAAT 


660 


ATTTGCTAAT 


AAAAATACTG 


TATTATTACC 


CTCTTAAGGT 


GCGAAATTAT 


TCAAGTTCTT 


720 


(2) INFORMATION FOR SEQ ID NO: 376: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

CGCCATTTCC CATCGTACCG CCGAAAATCC CAGCGCCTCA GCCATCAAAT ATCCTATCAA 60 

CGTTCTCAAA AAAAGTGACC GCTCTCTCAT CATGTTTCCA AGTGGTAGCC GCCACTCAAA 120 

CGATGTCAAG GGGGGCGCAC ACTskATTGC CAAAATGGCC AAGGTCCGTA TCATGCCGGT 180 

TACCTACACC GGTCCCATGA CTTTGAAGGG CTTGATTAGC CGTGAACGTG TCGATATGAA 240 

CTTTGGAAAT CCAATCGATA TCTCAGATAT CAAGAAAATG AATGATGAAG GCATTGAAAC 300 

AGTCGCCAAT CGTATTCAAA CAGAATTCCA ACGTCTGGAC GAAGAAACGA AACAATGGCA 360 

CAATGATAAA AAACCAAATC CACTCTGGTG GTTTATCCGC ATCCCTGCCC TCATCCTTGC 420 

TATTATCCTC GCTATCCTAA CCATCATCTT TAGCTTTATC GCAAGCTTCA TCTGGAACCC 480 

AGATAAGAAA AGAGAAGAAC TTGCATAGAA GAAATGAACC TTGGCCAAAC AGCTAAGGTT 540 

TTCATTTATA TAGTAGATTG GwACTAGAAT AGTACACCTC TACTTCTAAA ACATTTTTAG 600 

AAATCGATTT GACTGTCCTG ATCGATTTGT CCTAATCTTA TTTCAATT 648 
(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

GTGCATCGCT TTCAGCATCG ACAAGTGCGT CTGAATCGGC ATCAACGAGT GCTTCGGCTT 60 

CAGCATCAAC GAGTGCGTCA GCTTCAGCAA GCACATCAGC TTCTGAATCT GCATCAACCA 120 

GTGCGTCCGC TTCAGCGTCA ACCAGTGCGT CGGCTTCAGC GTCGACAAGT GCTTCGGCTT 180 

CAGCATCAAC GAGTGCGTCG GCCTCAGCAA GCGCAAGTAC CTCAGCGTCA GCTTCCGCCT 240 

CAACCAGTGC GTCCGCTTCA GCAAGCACAA GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT 300 

CTGAATCGGC ATCAACGAGT GCGTCGGCCT CAGCAAGCGC AAGTACCTCA GCGTCAGCTT 360 

CCGCCTCAAC GAGTGCGTCG GCTTCAGCAA GCACAAGTGC GTCAGCCTCA GCAAGTATCT 420 

CAGCGTCTGA ATCGGCATCA ACGAGTGCGT CTGAGTCAGC ATCAACGAGT ACGTCAGCCT 480 

CAGCAAGCAC ATCAGCTTCT GAATCGGCAT CAACCAGTGC GTCAGCCTCA GCATCGACAA 540 

GCGCCTCAGC TTCAGCAAGT ACCAGTGCTT CAGCCTCAGC GTCGACAAGT GCGTCGGCCT 600 

CAACCAGTGC ATCTGAATCG GCATCAACCA GTGCGTCAGC CTCAGCAAGT ACTAGTGCAT 660 
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CAGCTTCAGC ATCAACGAGT GCATCGGCTT 
(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 
CGAGATTCTC TGGAGTTATG GATGTCGTTC CAATATGTGC ACGTTGGAAT GTTAGTGCTT 
ATATGGGGGG AACAGAATCC TCTCTTGATT GAAGACAAGC TAGTCATTAG GCTGGTTTGT 
CTTTTTGTCA ACTGTAGTGG GTTGATATAA TAGTATTAGT GAGTGGGATA AAAGTTTCAT 
TTAGTTTATT CAGTACAAAT TTAACGGGTC AAGATTTATA TACTAGTGGT GTTTTTGGGG 
CTGAGAGAAG TATCTTGATT TTATGTGTGG TTTTTATACT TACAGTTGTT CTGCTCCAAA 300 
GAGCTTGTAG AGAAGAATTA GCTCATAAAG GAGATTGATT ATTTTGATAT CAAAAAAATG 3 60 

CACAGGATAA CCTGATGCAT TTTTTTAGCG ACAATGCTTG CTACTTCCTT CTGTCGAATT 420 
TAGACAATTT TAAACCCCAA TTATTCACCC CAAATCTAAA AACCATCCAG AATCCTTGCC 4 80 

TTAGCTTAGA TCCTGGATGG TTTCTTTTTT CACCCAATGG GTGTTTTTTA CTAGACAAAA 
AAGAGTTTCC CCTTTATGGT ATAAGTGTAG AAAAAAACAC AAAAAGAAAG GAAACTCACA 
TGAACAGTTT ACCAAATCAT CACTTCCAAA ACAAGTCTTT TTACCAACTA TCTTTCGATG 660 
GAGGTCATTT AACCCAGTAT GGTGGTCTTA TCTTTTTTCA GGAACTTTTT TCCCAGTTGA 720 
AACTAAAAGA GCGGATTTCT AAGTATTTAG TAACGAATGA CCAACGCCGC TACTGTCGTT 780 
ATTCGGATTC AGATATCCTT GTCCAGTTCC TCTTTCAACT GTTAACAGGT TATGGAACGG 840 
ACTATGCTTG TAAAGAATTG TCAGCTGATG CCTACTTTCC AAAATTATTG GAAGGAGGGC 900 
AGCTTGCTTC ACAGCCAACC TTATCCCGTT TTCTTTCCAG AACTGACGAG GAAACAGTCC 960 
ATAGTTTGCG ATGCCTCAAC CTTGAATTGG TCGAATTCTT TTT 1003 
(2) INFORMATION FOR SEQ ID NO: 379: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 



600 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 



wo 98/18931 



PCT/US97/19588 



60 
120 
180 
240 
300 
360 



660 
720 
738 
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CCGATGATTC TGATTGGTTT GCTCTTTACT TTGCTGGGAA TTTTGAGGTA GATCTATGAT 
TGAAATACTA ATTGTTTTAG CTATTATCCT ATCTCTTGCT TTGATTGTAT TGGTAACTAT 
ACAACCCCGT CAAAATCAAC TATTTTCCAT GGATGCCACT AGTAATATTG GTAAACCAAG 
CTACTGGCAG AGCAACACCT TGGTCAAGGT GCTCACTTTA TTGGTGAGTT TGGCTTTATT 
TATTCTACTA TTAACCTTTA TGGTGATTAC TTATAAATAA AAGAAAACTT CAGATATTCA 
CCTTTTGTGG ATTGGTCTGA AGTTTTCTTT TTTATACTCA ATGAAAATCA AAGAGCAAAC 
TAGGAAGCTA GCCGCAckGC TCAAAACACC GTTTTGAGGT TGTAGATATA ACTGACGAGc 420 
GACTCAAAAC ACCGTTTTGA GGTTGTAGAT ATAACTGACG AGcGACTCAA AACACCGTTT 480 
TGAGGTTGTG GATAGAACTG ACGAGcGACT CAAAACACCG TTTTGAGGTT GTGGATAGAA 540 
CTGACGAAGT CGcTCAAAAC ACCGTTTTGA GGTTGTGGAT AGAACTGACG AAtgctCAAA 600 
ACACCGTTTT GAGGTTGTGG ATAGAACTGA CGAAGCgaaC ATATATACAG CAAGGCGACG 
CTGACGTGGT TTGAAGAGTA TTACTGTCTA TATTTTTGGT AAAAATCAAC TTTTACTTGG 
ATGAAGGTTT TTTTTTTT 
(2) INFORMATION FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

CCGTCTTATC AAAGAGGTTA ACAAAGGCAC CAAATTTCTC GATACGAACG ACTTTAGCAC 60 

GGTAAACTTC ATCCACTTTG GCTTCACGAA CCAAACCAGC AATAATTTCT TTGGCACGGT 120 

TAATAGCATC TTGGTCACTA GAGTAGATAG ACACATTTCC TTCTTCGTCT ATATCAATCT 180 

TAACACCTGT TTCAGCGATA ATCTTGTCGA TGGTTTCTCC ACCCTTACCG ATGACAATCT 240 

TAATCTTGTC CACATCAATC TTGATCGTAT CAATTTTCGG AGCAGTTGGA GCCAATTCTG 300 

GACGAACTTC TGGAATGGTT GCTTCAATGA CATCAAGGAT TTCAAAACGC GCTTTCTTGG 360 

CTTGAGCAAG AGCCTCCGTC AAGATTTCTG CAGTAATCCC TTGAATCTTG ATATCCATTT 420 

GAAGGGCTGT AATCCCATCA CGAGTACCTG CAACCTTGAA GTCCATATCT CCAAAGTGAT 480 

CTTCCAAACC TTGGATATCT GTCAATACTG TGTAGTTATT TCCATCTGAG ATAAGCCCCA 540 

TAGCAATACC AGCTACTGGC GCCTTGATTG GCACACCACC AGCCATAAGG GCAAGAGTTC 600 
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CCGCACAGAT AGAAGCTTGA GATGAAGAAC CGTTTGATTC CAAAACTTCT GCTACTAGAC 660 
GGATAGCGTA GGGGAATTCT TCCAAGCTTG GCAGG 695 



(2) INFORMATION FOR SEQ ID NO: 381: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 691 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 
GACATCTTAT CTAAATACAT GCTAATATAT TTAGATACAA ACATTCCAAC TTGATAATTT 60 

TCACTCATCT TTCATCATTC CTTATACAAC TATGCAGTAT AAATAGAATA GTTTTCTCAT 120 

CAGAATGAGA CTATTTTAAT ATTAGATCCC CAATTATTCA CCCCAAATCT AAAAACCATC 180 

CAGAATCCTT GCCTTAGCTT AGATCCTGGA TGGTTTCTTT TTTCACCCAA TGGGTGTTTT 240 

TTACTAGACA AAAAAGAGTT TCCCCTTTAT GGTATAAGTG TAGAAAAAAA CACAAAAAGA 300 

AAGGAAACTC ACATGAACAG TTTACCAAAT CATCACTTCC AAAACAAGTC TTTTTACCAA 3 60 

CTATCTTTCG ATGGAGGTCA TTTAACCCAG TATGGTGGTC TTATCTTTTT TCAGGAACTT 420 

"TTTCCCAGT TGAAACTAAA AGAGCGGATT TCTAAGTATT TAGTAiiCGAA Tox-»v.^j-ij-i^o^ -mw 

CGCTACTGTC GTTATTCGGA TTCAGATATC CTTGTCCAGT TCCTCTTTCA ACTGTTAACA 540 

GGTTATGGAA CGGACTATGC TTGTAAAGAA TTGTCAGCTG ATGCCTACTT TCCAAAATTG 600 

TTGGAAGGAG GGCAGCTTGc TTCACAGCCA ACCTTATCCC GwTTTCTTTC CAGAACTGAC 660 

GAGGAAACAG TCCATAGTTT GCGATGCCTC A 691 

{2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

ATCTCTCTGC GTAATGGTCC TCAGATAACT CTGATGATGT GTGGCGATAT AGAACTGAGC 60 

CAAGTTATGC CTAAAGGGCC TTAGGAATAG GAGCTTTCAC AAGCTTATCC AGATGATTAT 120 

CTTTTACTCG TTATGGACAA TGCTATATGG CATAAATCAA GTACCTTAAA GATTCCGACT 180 

AATATTGGCT TTGCATTTAT TCCTCCATAC ACACCAGAGA TGAACCCCAT TGAACAAGTG 240 
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TGGAAAGAGA TTCGTAAACG TGGATTTAAG AATAAAGCCT TTCGAACTTT GGAAGATGTC 



300 



ATACAAGGAC TGGAGAAGGA GGTGATAAAG TCCATCGTTA ATCGGAGACG GACTAGAATG 



360 



CTTTTTGAAA ACAGATGAGT ATAAAAAGAA AGTCCTCATT TCAATAGAAA TCACGACTTT 



420 



CTGATGAATT TATAGTAAAA TGAAATAAGA ACAGGATAGT CAAATCGATT TCTAACAATG 



480 



TTTTAGAAGC AGAGGTGTAC TATTCTAGTT TAAATCCACT ATATTTGGGG AGTGATAGAA 



540 



AAGCCCTTCA TCAGCCAATC TACTTGTTCA GGTGCGAGAG CTTTGACATC CTTTTCTGTA 



600 



CTGGACCAAG TCAGTTTTCC GTTCTCAAAG CGTTTATATA ATATCCAAAA TCCTTGACCA 



660 



TCCCAGTAAA GAACTTTAAA GCGGTCTTTA CGTCCACCAC AAAAGAGAAA GACTTGATCG 



720 



GAGAAAGGAT CCAATTCAAA GTGGGTTTGG 



750 



(2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 

TCAAATTCTT CGTGGTCCGC ATATCTnTCT TCGTACACGG CAGTCACTTG GTCTTTCACT 60 

ACTCGAGTCG CAGCTTCACG GGCCAATTTC TCTTCTACTT GAACTGCCTT TTGGAGGTCA 120 

CTGTTGTAGG CTGCAATGAT TTCAGCTTGC AATTCAGCAT CCACGTGAAG CAATTCCACT 180 

TCTGCTTTTT CTTTACCGAC AGCAGCAACG ATTTCTTCTT GGAAGGCAAT CAATTCTTTG 2 40 

ACAGCTTCGT GCCCTTTAAG GAGCGCTTCC AACATGATTT CTTCTGACAA TTCTTTGGCA 300 

CCAGACTCTA CCATGTTGAT AGCGTGCTTG GTTCCAGCTA CTGTCAATTC AAGAAGAGAT 3 60 

TGCTCTGCTT GTTCTTGACT TGGGTTGATG ATGATTTGGC CATCTACATA TCCCACTTGT 420 

ACCCCAGCAA TTGGTCCGTC AAATGGAATA TCTGAAATAG ACAGTGCCAA AGATGAACCA 4 80 

AACATAGCAG CCATTGGTGC AGATGCATTT TCATCATAAG AAAGCACTGT ATTGATGACT 540 

TGGACTTCAT TACGGAAACC TTCCGCAAAC ATAGGACGAA TCGGACGGTC AATCAAACGC 600 

GCTGTCAAGG TCGCATCTGT TGAAGGACGT CCTTCACGTT TCATAAAGCC ACCAGGAAAC 660 

TTCCCAGCCG CATACATTTT TTCTTCGTAG TTGACTTGGA GTGGGAAGAA ATCCTCAGTT 720 

GCCATTTTCT GGGGATCC 738 
(2) INFORMATION FOR SEQ ID NO: 384: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 

CCCCCTATTT ACCGTGGACT AAAGTTGTAC AAGAAAAGTG CAAATAAGAA ATCTCCAGAT 60 

TAGGAACTAT ATATGAGTTC TCTAGTCTGG AGATTTTTCA ATAGACTTCG TTATTGGGCG 120 

GTTACTTTCG AAACTTTGAA AACTTCAAAA AACGGATTTT TATCGCTTTC AAATTCTTTT 180 

GGGGTCAAAC TCAGTAACTT ATTCGCCTTG TAGACTTCAT GACGCTCAGG GTATACTTTC 24 0 

AAGGTCCCAA ATAGCCAAGA ATCGTCAGCG ATATTATCTG AATCATCTCC TTCTTGTTCT 300 

CCTTTAGTTC GCCTGAGGAC AGCCTTGACA CGCGCCAGAA TTCTCTAGGG CTAAAAGGCT 360 

TGGTCAGGTA GTCATCAGCC CCTAATTCCA AGGCCAAAAC CTTATCAAAT TCATCACTTT 420 

TCGCAGAAAC CATCATAATT GGAGTTTTGA CGCCTTTGGC TCTCAGCCGC TTACAAACTT 480 

CCATGCCATC TAATTGTGGT AACATGATAT CAAGCAAGAT AAAATCAAAG GGTTCTGTTT 540 
CTGCCAAAGC TAAGGCCTTC CGTCCATTTG TCACCAATTG AGTAGAAAAG CCTTCCTTAC 
•l-rAAATCJG'i'A G'I'CAaOjCAAT T*TCaGAAtGT GT"TCTTCATC axCCACTAAT AAGACTT 
(2) INFORMATION FOR SEQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



600 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 



CCGCATCAGC 


ATCAACGAGT 


GCATCGGCTT 


CACGTCAACC 


AGTGCATCAG 


TCTCAGCAAG 


60 


CACCAGTGCG 


TCGGCTTCAG 


CATCAACGAG 


TGCCTCAGCC 


TCAGCAAGTA 


TCTCAGCGTC 


120 


TGAATCGGCA 


TCAACGAGTG 


CGTCAGCTCA 


GCAAGTACTA 


GTGCATCGGC 


TTCAGCAAGC 


180 


ACCAGTGCGT 


CGGCTTCAGC 


ATCAACGAGT 


GCCTCAGCCT 


CAGCAAGTAT 


CTCAGCGTCT 


240 


GAATCGGCAT 


CAACGAGTGC 


GTCACCTCAG 


CAAGTACTAG 


TGCATCAGCA 


TCAGCATCAA 


300 


CGAGTGCATC 


GGCTTCAGCA 


AGTACCAGCG 


CCTCAGCTTC 


AGCAAGCACC 


AGTGCGTCAC 


360 


CTCAGCAAGT 


ACCAGCGCCT 


CAGCCTCAGC 


AAGCACCAGT 


GCCTCAGCTT 


CAGCAAGTAC 


420 


CAGTGCGTCA 


CCTCAGCATC 


GACAAGTGCG 


TCGGCTTCAG 


CAAGTACCTC 


AGCGTCTGAA 


480 
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TCAGCATCAA CGAGTGCGTC AGCTTCAGCA TCAACCAGTG CCTCAGCCTC AGCAAGTATC 

AGTGCGTCAG CTTCAGCATC AACGAGTGCG TCAGCTGCAG CAAGTA 

(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 451 base pairs 
(B) TYPE: nucleic acid 
{C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 

CGTCGGCTTC AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT TCAGCTTCAG 60 

CAAGTACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCTTC AGTCTCAGCG TCAACCAGTG 120 

CCTCTGAATC CGCATCAACA AGTGCCTCGG CTTCAGCAAG CACCAGTGCT TCGGCTTCAG 180 

CGTCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTGCGTC ACCTCAGCAA GCACATCAGC 240 

TTCTGAATCT GCATCAACCA GTGCGTCAGC TTCCGCATCA ACAAGCGCCT CGGCCTCAGC 300 

AAGTACAAGT GCTTCAGCCT CAGCATCAAC CAGTGCATCA GCTTCAGCCT CAACAAGTGC 360 

TTCAGCCTCA GCGTCAACCA GTGCCTCGGC TTCAGCAAGT ACCAGTGCGT CAGTTcAGCA 42 0 

AGCACAAGTG CGTCAATTTA GCATCAACCA G 451 
(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 
TCTCAGCAAG CACCATTGCG TCGGCTTCAT CAAGCACCAG CGCGTTTGAA TCCGCATCAA 

CCAGTGCTTC AGCTTCAGCC AAGTTACCTC AGCATCTGAA TCAGCATCAA CAAGTGCATC 120 

GGCTTCAGCA AGCACAAGTG CTTCAGCtCA GCAAGTATCT CAGCGTCTGA ATCGGCATCA 180 

ACGAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCAT CAGCGTCAAC AAGTGCTTCG 240 

GCTTCAGCCT CAACGAGTGC GTCTGAGTCA GCATCAACCA GTACGTCAGC CTCAGCAAGC 300 

ACATCAGCTT CTGAATCTGC ATCAACCAGT GCGTCAGCCT CAGCATCGAC AAGCGCCTCA 360 

GCTTCAGCAA GTACCAGTGC GTCAGCCTCA GCAAGTACCA GTGCTTCAGC CTCAGCGTCG 420 
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ACAAG 

(2) INFORMATION FOR SEQ ID NO: 388: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 88: 

AGAGGATCCC CGGATCCTCA GTCGCTGAGA TAACTCCTTT GGGCTTGTTC ATCATGTAGT 60 

AGACAAACTC TTCATACTCC AACACTTGCC CATTTTATGC GAATCTCATC TATTTTTTCT 120 

TTTTTTTGCA ATTTAGCTGA TTTTTCTTTT TTACCATTTA CAGTCACGCG CCCAGCCTTG 180 

AGCAAGTTTT TGACCTCAGT CCGACTTCCC ACCGCACAGG CAACTAAAAA TTTATCTAAT 240 

CTCATAGAAC TATTATATCA TATCAAAAGG AGGCTAGTAC AATGACCAAC CTCCTTTTCG 300 

TTTCATACTC TTCAAAAATC TCTTCAAACC GCGTCAACGT CGCCTTGCCG TATATATGTT 3 60 

ACTGACTTCG TCAGTTCTAT CTGCAACCTC AAAACAGTGT TTTGAGCTGA CTTCGTCAGT 420 

TCTATCTGCA ACCTCAAAGC AGTGCTTTGA GCATCCTGCG GCTAGTTTCC kAGTkTGCTC 480 

TTTGATTTwC ATTGAGTATC AGATTTAGGA AATTAACTTC CTtJG'KC'i'CCA AAAAAkAOCT 540 

AAAACAATCA AGGCTCCTAA AATCGCTGGG AT 5'' 2 
(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

CAACAAGTGC CTCGGCTTCA GCATGCACAA GTGCTTCAGC TTCAGCATGT ACCTGAGCGT 60 

CTGAATCAGC ATCAACGTGT GCGTCCGCTT CAGCATGTAC TGCTGCCTCA GCATCAGCGT 120 

CAAcAwGTGC TTCGGCTTCA GCGTCAACGA GTGCGTCTGA GTCAGCATCA ACGAGTACGT 180 

CAGCCTCAGC AAGCACATCA GCTTCTGAAT CTGCATCAAC CAGTGCGTCA GCCTCAGCAT 240 

CGACAAGCGC CTCAGCTTCA GCAAGTACCA GTGCGTCAGC CTCAGCAAGT ACCAGTGCTT 300 

CAGCCTCAGC GTCGACAAGT GCGTCGGCCT CAACCAGTGC ATCTGAATCG GCATCAACCA 360 

GTGCGTCAGC CTCAGCAAGT ACTAGCGCCT CAGCCTCAGC ATCAACGAGT GCGTCCGCTT 420 
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CAGCAAGTAC TAGTGCATCA GCATCAGCAT CAACGAGTGC ATCGGCTTCA GCAAGTACCA . 480 



GCGCCTCAGC TTCAGCAAGC ACCGG 
(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



505 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 390: 

GCTAAGACTA CCTCATTAGG GGCATAGGCT GCTAAAATAA CTGCAGCTGT GGTTAATGAC 60 

AATACTGTAC TTTTTTTCAT TTTAATTCCT TACATATTTA TATAACTTCC AATAGATAAT 120 

AAACTTTAAC TTTGCTAGCC TTTGTTATAA AAAGTTTTAC TAAGTATTAT CTAGGAAATA 180 

GAGTAGTACA TTTATATATA ATTGTTATCT CTCTATAAAA ACAGTATATC ATTTAAAAAA 240 

ATTTAAGTCA AAAAAATTAA CATTAGTTAA TTTATTTTTT AGCACACATT AAAAAATAAG 300 

ATTAGTACTC AATGAAAATC AAAGAGCAAA CTAGGAAACT AGCCGCAGAT TGCTCAAAAC 360 

AGTGTTTTGA GGTTGTAGAT GGAATGACGT AGTCAGCTCA AAACACTGTT TTGAAGTTGT 420 

GGATAGAACT GACGAAGTCG GTACCGA 447 
(2) INFORMATION FOR SEQ ID NO: 3 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

AGCACTTGTC GTTGAATTCT ACAACAAAAT GTTGTAATAT TTTATTGAAT AAGATAGGCC 60 

TTGATATTAA GCACTTTGGG ACGTTCTCCC TTAGTGCTTT TTTGATTTCT CTTAGTATCC 120 

AGCTATAATC GTTGAGACAT AACTAGACCG ATATAGTCCA AAGTGATATA GTAAAATGAA 180 

CCAAAAATAG TACACAATGT GGTATAATCC TTTTATGGCA TATTCAATAG ATTTTCGTAA 240 

AAAAGTTCTC TCTTATTGTG AGCGAACAGG TAGTATAACA GAAGCATCAC ACGTTTTCCA 300 

AATCTCACGT AATACCATTT ATGGCTGGTT AAAGCTAAAA GAGAAAACAG GAGAGCTAAA 360 

CCACCAAGTA TAGTGTATTG AATCTATAAC AGTACACCTT GGCTGCTAAA ATATTTCTAT 420 
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AAATTAATTT GACTTTCCTG ATAGAGATGT TCACATCTTA TTTCAAACTA CTATATAAGT 4 80 

TCTATAATCT CTTTATAAGA TTTGCCCATC AGACAAAATA GAACGATTTG AAGGCGTTTA 540 

TGATATTTAG CTGTACGAGA GTCTTTTAAA AG 572 
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DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed by 
the applicant, any request made by a diird party for the furnishing of a sample shall indicate the 
expert to be used. That expert may be any person entered on a list of recognized experts drawn 
up by the Danish Patent Office or any person approved by the applicant in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office, 
without havixig been laid open to public iJuspectiOu, the furnishing of a sample shall only be 
effected to an expert in the aft. The request to this effect shall be filed by the applicant with the 
Intemational Bureau before the expiration of 16 months firom the priority date (preferably on the 
Form PUT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such a 
request has been filed by the applicant, any request has been fded by the applicant, any request 
made by a third party for the fiimishing of a sample shall indicate the expert to be used.^ That 
expert may be any person entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by the applicant in the individual case. 

UNITED KINGDOM 

The. applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
Intemational Bureau before the completion of the technical preparations for the Intemational 
publication of the application. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapse, the microorganism shall be made 
available as provided in Rule 3 1F(1) of the Patent Rules only by the issue of a sample to an 
expert. The request to this effect must be fiimished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of tiie Patents Act of tiie Kingdom of the Netherlands, 
whichever two dates occurs earlier. 
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SINGAPORE 



The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for mtemational 
publication of the application. 



NORWAY 



The appUcant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the fiimishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the apphcant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegians Patents Act If such a request has been 
fUed by the applicant, any request made by a third party for the fiimishing of a s^ple shall 
indicate the expert to be used. That expert may be any person entered on a list of recogmzed 
experts drawn up by the Norwegian Patent Office or any person approved by the applicant m the 



JUIUJ, ViU.U£U. 



AUSTRALIA 



The applicant hereby gives notice that the fiimishing of a sample of a imcroorgamsm shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refiisal or withdrawal of the 
application, to a person who is a skilled addressee without an interest m the mvention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to P^bUc inje^^^^ 
(by the National Board of Patents and Registration), or has been finally decided upon by Ae 
National Board of Patents and Registration without having been laid open to public mspection, 
the fiimishing of a sample shall only be effected to an expert m the art. 

ICELAND 

The appUcant hereby requests that, until the application has been laid open to vf^^^P^^'^^^ 
(by the'lcelandic PaLt Office), or has been finally decided upon by the ^f^^ 
without having been laid open to public inspection, the fimushmg of a sample shall only be 
effected in the art. 
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What Is Claimed Is: 

1. Computer readable medium having recorded thereon the nucleotide 
sequence depicted in SEQ ID NOS: 1-391, a representative fragment thereof or a 
nucleotide sequence at least 95% identical to a nucleotide sequence depicted in SEQ 
IDNOS:l-391. 

2. Computer readable medium having recorded thereon any one of the 
fragments of SEQ ID NOS: 1 -391 depicted in Tables 2 and 3 or a degenerate variant 
thereof. 

3. The computer readable medium of claim 1, wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (KOM), and CD-ROM. 

4. The computer readable medium of claim 3, wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (ROM), and CD-ROM. 

5. A computer-based system for identifying fragments of the Streptococcus 
pneumoniae genome of commercial importance comprising the following elements: 

a) a data storage means comprising the nucleotide sequence of SEQ ID 
NOS: 1-391, a representative fragment thereof, or a nucleotide sequence at least 
95% identical to a nucleotide sequence of SEQ ID NOS: 1-391 ; 

b) search means for comparing a target sequence to the nucleotide sequence 
of the data storage means of step (a) to identify homologous sequence(s), and 

c) retrieval means for obtaining said homologous sequence(s) of step (b). 

6. A method for identifying commercially important nucleic acid fragments 
of the Streptococcus pneumoniae genome comprising the step of comparing a 
database comprising the nucleotide sequences depicted in SEQ ID NOS:l-391. a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to a 
nucleotide sequence of SEQ ID NOS: 1-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence is not randomly selected. 
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7. A method for identifying an expression modulating fragment of 
Streptococcus pneumoniae genome comprising the step of comparing a database 
comprising the nucleotide sequences depicted in SEQ ID NOS: 1-391, a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to 
the nucleotide sequence of SEQ ID NOS: 1-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence comprises sequences known to 
regulate gene expression. 

8. An isolated protein-encoding nucleic acid fragment of the Streptococcus 
pneumoniae genome, wherein said fragment consists of the nucleotide sequence of 
any one of the fragments of SEQ ID NOS: 1-391 depicted in Tables 2 and 3, or a 
uegeneraie variant thereof. 

9. A vector comprising any one of the fragments of the Streptococcus 
pneumoniae genome SEQ ID NOS: 1-391 depicted in Tables 2 and 3 or a 
degenerate variant thereof. 

10. An isolated fragment of the Streptococcus pneumoniae genome, 
wherein said fragment modulates the expression of an operably linked open reading 
frame, wherein said fragment consists of the nucleotide sequence from about 10 to 
200 bases in length which is 5' to any one of the open reading frames depicted in 
Tables 2 and 3 or a degenerate variant thereof. 

11. A vector comprising any one of the fragments of the Streptococcus 
pneumoniae genome of claim 8. 

12. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 8. 

13. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 10. 
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14. A method for regulating the expression of a nucleic acid molecule 
comprising the step of covalently attaching to said nucleic acid molecule a nucleic 
acid molecule consisting of the nucleotide sequence from about 10 to 100 bases 5' 

95 to any one of the fragments of the Streptococcus pneumoniae genome depicted in 

SEQ ID NOS: 1-391 and Tables 2 and 3 or a degenerate variant thereof. 

15. An isolated nucleic acid molecule encoding a homolog of any of the 
fragments of the Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and 

100 Tables 2 and 3, wherein said nucleic acid molecule is produced by a process 

comprising steps of: 

a) screening a genomic DNA library using as a probe a target sequence 
defined by any of SEQ ID NOS: 1-391 and Tables 2 and 3, including fragments 
thereof; 

105 b) identifying members of said library which contain sequences that 

hybridize to said target sequence; and 

c) isolating the nucleic acid molecules from said members identified in step 

(b). 

no 

16. An isolated DNA molecule encoding a homolog of any one of the 
fragments of the Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and 
Tables 2 and 3, wherein said nucleic acid molecule is produced a process 
comprising steps of: 

115 a) isolating mRNA, DNA, or cDNA produced from an organism; 

b) amplifying nucleic acid molecules whose nucleotide sequence is 
homologous to amplification primers derived from said fragment of said 
Streptococcus pneumoniae genome to prime said amplification; 

c) isolating said amplified sequences produced in step (b). 

120 

17. An isolated polypeptide encoded by any of the fragments of the 
Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and depicted in Table 2 
and 3 or by a degenerate variant of said fragments. 



125. 



18. An isolated polynucleotide molecule encoding any one of the 
polypeptides of claim 17. 
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19. An antibody which selectively binds to any one of the polypeptides of 
claim 17. 

20. A method for producing a polypeptide in a host cell comprising the 
steps of: 

a) incubating a host containing a heterologous nucleic acid molecule whose 
nucleotide sequence consists of any one of the fragments of the Streptococcus 
pnewmaniae genome of SEQ ID NOS: 1-391 and depicted in Tables 2 and 3, under 
conditions where said heterologous nucleic acid molecule is expressed to produce 
said protein, and 

b) isolating said protein. 
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1. Claims: 1-7 

Computer readable medium having recorded thereon the 
nucleotide sequence depicted in SEQ ID nos* 1-391, a 
representative fragment thereof or a nucleotide sequence at 
least 95% identical to a nucleotide sequence depicted in SEQ 
ID nos. 1-391; a computer-based system for identifying 
fragments of the Streptococcus pneumoniae genome of 
coimiercial importance comprising: a) a data storage means 
comprising said nucleotide sequence(s); b) search means for 
comparing a target sequence to the nucleotide sequence of 
the data storage means of step (a) to identify homologous 
sequence(s), and c) retrieval means for obtaining said 
homologous sequence(s) of step (b); a method for identifying 
coimiercial ly important nucleic acid fragments of the 
Streptococcus pneumoniae genome comprising the step of 
comparing a database comprising said nucleotide sequence(s) 
with a target sequence to obtain a nucleic acid molecule 
comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence is not 
randomly selected; a method for identifying an expression 
modulating fragments of the Streptococcus pneumoniae genome 
comprising the step of comparing a database comprising said 
nucleotide sequence(s) with a target sequence to^obtain a 
nucleic acid molecule corrprised of a coiiiHlcm£ntary 
nucleotide sequence to said target sequence, wherein said 
target sequence comprises sequences known to regulate gene 
expression; 

2. Claims: (8-20) partially 

An isolated protein-encoded nucleic acid fragment of the 
Streptococcus pneumoniae genome, wherein said fragment 
consists of the nucleotide sequence of the fragment of 
SEQ ID no.l depicted in Tables 2 and 3, or a degenerate 
variant thereof; a vector comprising the fragment of the 
Streptococcus ^Dneumoniae genome SEQ ID no.l; an isolated 
fragment of the Streptococcus pneumoniae genome, wherein 
said fragment modulates the expression of an operably linked 
open reading frame, wherein said fragment consists of the 
nucleotide sequence from about 10 to 200 bases in le^Sth 
which is 5' to any one of the open reading frame of SEQ ID 
no.l depicted in Tables 2 and 3 or a degenerate variant 
thereof; a method for regulating the expression of a nucleic 
acid molecule comprising the step of covalently attaching to 
said nucleic acid molecule a nucleic acid molecule 
consisting of the nucleotide sequence from about 10 to 100 
bases 5' to any one of the open reading frame of SEQ ID 
no.l and Tables 2 and 3 or a degenerate variant thereof; an 
isolated nucleic acid molecule encoding a homolog of SEQ 
ID no.l; an isolated polypeptide encoded by SEQ ID no.l and 
depicted-.in Table 2 and 3; an antibody which selectively 
binds to any one of said polypeptides, a method for 
producing a polypeptide* in a host cell comprising a) 
incubating a host containing a heterologous nucleic acid 
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molecule whose nucleotide sequence consists of 5EQ ID no.l 
and depicted in Table 2 and 3. under conditions where said 
heterologous nucleic acid molecule is expressed to produce 
said protein, and b) isolating said protein; 

3-392. Claims: (8-20) partially 

Idem as subject 2 but limited to each of the sequences of 
SEQ ID no. 2 to 391; 

For the sake of conciseness, the second subject na^er i s 
explicitly defined, the other subject matters are defined by 
analogy hereto. 
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1, Claims: 1-7 

Computer readable medium having recorded thereon the 
nucleotide sequence depicted in SEQ ID nos. 1-391, a 
representative fragment thereof or a nucleotide sequence at 
least 95% identical to a nucleotide sequence depicted in 5tQ 
ID nos. 1-391; a computer-based system for identifying 
fragments of the Streptococcus pneumoniae genome of 
coimiercial importance comprising: a) a data storage means 
comprising said nucleotide sequence(s); b) search means tor 
comparing a target sequence to the nucleotide sequence ot 
the data storage means of step (a) to identify homologous 
sequence(s), and c) retrieval means for obtaining said 
homologous sequence(s) of step (b); a method for identifying 
conmercially important nucleic acid fragments of the 
Streptococcus pneumoniae genome comprising the step of 
comparing a database comprising said nucleotide sequencers ) 
with a target sequence to obtain a nucleic acid molecule 
comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence is not 
randomly selected; a method for identifying an expression 
modulating fragments of the Streptococcus pneumoniae genome 
comprising the step of comparing a database comprising saia 
nucleotide sequence{s) with a target sequence to obtain a 
nucieic aciQ moiecuie convt iscu ui a uunin j . 

nucleotide sequence to said target sequence, wherein saia 
target sequence comprises sequences known to regulate gene 
expression; 

2. Claims: (8-20) partially 

An isolated protein-encoded nucleic acid fragment of the 
Streptococcus pneumoniae genome, wherein said fragment 
consists of the nucleotide sequence of the fragment of 
SEQ ID no.l depicted in Tables 2 and 3, or a degenerate 
variant thereof; a vector comprising the fragment of the 
Streptococcus pneumoniae genome SEQ ID no.l; an isolated 
fragment of the Streptococcus pneumoniae genome, wherein 
said fragment modulates the expression of an operably linKea 
open reading frame, wherein said fragment consists of tne 
nucleotide sequence from about 10 to 200 bases in lenfth 
which is 5' to any one of the open reading frame of SEQ ID 
no.l depicted in Tables 2 and 3 or a degenerate variant . 
thereof; a method for regulating the expression of a nucleic 
acid molecule comprising the step of covalently attaching to 
said nucleic acid molecule a nucleic acid molecule 
consisting of the nucleotide sequence from about 10 to 
bases 5' to any one of the open reading frame of SEQ ID 
no.l and Tables 2 and 3 or a degenerate variant thereof; an 
isolated nucleic acid molecule encoding a homolog of SEQ 
ID no.l; an isolated polypeptide encoded by SEQ ID no.l and 
depicted. in Table 2 and 3; an antibody which selectively 
binds to any one of said polypeptides, a method for 
producing a polypeptide" in a host cell comprising a) 
incubating a host containing a heterologous nucleic acid 
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molecule whose nucleotide sequence consists of SEQ ID no.l 
and depicted in Table 2 and 3. under conditions where said 
heterologous nucleic acid molecule is expressed to produce 
said protein, and b) isolating said protein; 



3-392. Claims: (8-20) partially 

Idem as subject 2 but limited to each of the sequences of 
SEQ ID no. 2 to 391; 

For the sake of conciseness, the second subject nia"er i s 
explicitly defined, the other subject matters are defined by 
analogy hereto. 
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1. Claims: 1-7 



Computer readable medium having recorded thereon the 
nucleotide sequence depicted in SEQ ID nos» 1-391, a 
representative fragment thereof or a nucleotide sequence at 
least 95% identical to a nucleotide sequence depicted in 5EQ 
ID nos. 1-391; a computer-based system for identifying 
fragments of the Streptococcus pneumoniae genome of 
conmercial importance comprising: a) a data storage means 
comprising said nucleotide sequence(s); b) search means tor 
comparing a target sequence to the nucleotide sequence ot 
the data storage means of step (a) to identify homologous 
sequence(s), and c) retrieval means for obtaining said 
homologous sequence(s) of step (b); a method for identifying 
coimerciany important nucleic acid fragments of the 
Streptococcus pneumoniae genome comprising the step of 
comparing a database comprising said nucleotide sequence(s) 
with a target sequence to obtain a nucleic acid molecule 
comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence is not 
randomly selected; a method for identifying an expression 
modulating fragments of the Streptococcus pneumoniae genome 
comprising the step of comparing a database comprising said 
nucleotide sequence(s) with a target sequence to obtain a 

1 ampn tarv 
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nucleotide sequence to said target sequence, wherein saia 
target sequence comprises sequences known to regulate gene 
expression; 



2. Claims: (8-20) partially 

An isolated protein-encoded nucleic acid fragment of the 
Streptococcus pneumoniae genome, wherein said fragment 
consists of the nucleotide sequence of the fragment of 
SEQ ID no.l depicted in Tables 2 and 3, or a degenerate 
variant thereof; a vector comprising the fragment of the 
Streptococcus pneumoniae genome SEQ ID no.l; an isolated 
fragment of the Streptococcus pneumoniae genome, wherein 
said fragment modulates the expression of an operably jinked 
open reading frame, wherein said fragment consists of the 
nucleotide sequence from about 10 to 200 bases in lenQ^n 
which is 5' to any one of the open reading frame of SEQ ID 
no.l depicted in Tables 2 and 3 or a degenerate variant . 
thereof; a method for regulating the expression of a nucleic 
acid molecule comprising the step of covalently attaching to 
said nucleic acid molecule a nucleic acid molecule 
consisting of the nucleotide sequence from about IQ to 100 
bases 5' to any one of the open reading frame of SEQ ID 
no.l and Tables 2 and 3 or a degenerate variant thereof; an 
isolated nucleic acid molecule encoding a homolog of SEQ 
ID no.l; an isolated polypeptide encoded by SEQ ID no.l and 
depicted. in Table 2 and 3; an antibody which selectively 
binds to any one of said polypeptides, a method for 
producing a polypeptide' in a host cell comprising a) 
incubating- a host containing a heterologous nucleic acid 



